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FOREWORD 


The  Lockheed  Missiles  and  Space  Division  sponsors  a  comprehensive  program  of 
general  research  in  connection  with  its  defense  contracts .  A  portion  of  the  total 
research  effort  at  LMSD  is  carried  out  in  Flight  Sciences  in  the  fields  of  fluid 
mechanics,  mechanics  of  deformable  bodies,  flight  dynamics,  space  mechanics, 
and  mathematics  and  statistics . 

The  results  obtained  from  theoretical  and  experimental  studies  in  Flight  Sciences 
are  felt  to  be  of  general  interest  and  have  been  assembled  into  a  collection  of 
research  reports .  These  reports  comprise  a  series  of  papers  which  deal  with 
various  topics  in  the  fields  listed  above. 

This  volume  is  concerned  with  mathematics  and  statistics. 
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C.T.  Yang 


FOREWORD 


This  article  is  a  reprint  of  the  Technical  Doc\iment 
12430-285875^  hearing  the  same  title  and  dated  September 
1959 «  It  discusses  work  carried  out  under  the  Lockheed 
General  Research  Program. 
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INTERVAL  ANALYSIS  I 


INTRODUCTION 

Digital  computations  by  computers  consist  of  finite  sequences  of  psuedo- 
arithmetic  operations.  On  the  other  hand,  the  exact  numerical  solution 
of  a  mathematical  problem,  if  computable  at  all,  often  requires  an  infi¬ 
nite  sequence  of  exact  arithmetic  operations. 

The  study  of  approximation  by  digital  computations  is  the  underlying 
motivation  for  the  present  study.  A  digital  computation  and  the  analysis 
of  its  error  as  an  approximation  are  usually  carried  out  separately. 
However,  in  the  present  study  an  interval  arithmetic  is  devised  which 
forms  a  basis  for  a  concomitant  analysis  of  error  in  a  digital  computa¬ 
tion.  In  this  system,  computations  are  performed  with  intervals  and 
intervals  are  so  produced  to  contain,  by  construction,  the  exact  numeri¬ 
cal  solutions  sought  Hence  an  approximation  and  its  possible  error  will 
be  obtained  at  the  same  time,  choosing  say  the  midpoint  of  am  interval  as 
the  approximation. 

This  report  is  the  first  part  of  our  study,  in  which  we  first  examine 
some  properties  of  exact  or  ideal  interval  arithmetic.  After  a  prelimi¬ 
nary  discussion  of  the  space  of  intervals  (§1)  we  study  addition,  multi¬ 
plication,  subtraction  and  division  of  intervals  (§§  2-5 ) .  Then  we  con¬ 
struct  arithmetic  functions  as  compositions  of  these  elementary  operations 
(  §6).  As  one  may  expect,  arithmetic  functions  play  exactly  the  same  role 
in  interval  analysis  as  rational  functions  in  real  analysis,  so  that  there 
is  a  relation  between  arithmetic  functions  and  rational  functions  (§7). 
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In  the  present  report  we  apply  Interval  analysis  to  the  study  of  the 
following  problems .  Let  f(x)  he  a  continuous  real-valued  function  de¬ 
fined  on  an  interval  [ja^h]]  .  (1)  What  is  the  image  interval  7 

(2)  What  Is  the  definite  Integral  j^f(x)  dx  ?  When  f(x)  is  a  rational 

function,  the  approximations  are  given  In  theorems  1  and  2  (§§  8,9).  In 
general.  If  f(x)  Is  an  arbitrary  continuous  function,  we  may  still  have 
approximation  theoi^ms  3  and  (§  10),  although  they  are  not  as  precise 
as  the  first  two  theorems. 

In  forthcoming  reports  we  shall  apply  interval  analysis  to  differential 
equations  em.d  report  on  results  of  machine  computations  using  a  digital 
version  of  Interval  arithmetic  modified  to  enable  the  computations  to  be 
carried  out  with  pseudo-arithmetic  operations.  (See  also,  D4SD-l*Bl4-21, 
"Automatic  Error  Analysis  in  Digital  Computation,"  by  R.  E.  Moore.) 

1,  PRELIMINARIES 


Throughout  the  whole  study  R  denotes  the  real  line.  Whenever  a  and 
b  are  real  numbers  with  a  ^  b  ,  [a,b]  denotes  the  subset  of  R  con¬ 
sisting  of  all  the  reeJ.  numbers  x  with  a  <  x  ^  b  .  In  symbols. 


[|a,b3  =  |x€R  (  a  ^  x  ^  b^  . 
Let  A  be  the  set  of  all  such  sets  [a,b]  . 


That  means. 


A 


a,beR  and  a  ^  b 
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Then  we  have  natural  functions 


p  :  R  —h  J , 

a  J  — ►  R  , 

0  :  «J!  — ►  R  , 

7  :  el?  R  , 

a  :  *<  — ►  R  , 

defined  by 

p(x)  =  [x,x]  , 
a([a,b])  =  a  , 

<,([a,l,])  =  b  -  a 

respectively,  where  xcR  and  Ca,b]€^  . 

Whenever  x,x'€R  ,  we  let 

p(x,x')  =  |x  -  x' 1  . 

Then  p  is  a  metric  on  R  ,  that  means,  p  has  the  following  properties. 

(1)  Whenever  x,x'eR  ,  p(x,x')  =0  if  and  only  if  x  =  x'  . 

(2)  Whenever  x,x'eR  ,  p(x,x')  =  p(x',x)  . 

(3)  Whenever  x,x',x"eR  ,  p(x,x')  +  p(x',x")  ^p(x,x")  . 
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Whenever  A,A'eJ^  we  let 


P(A,A')  =  max  |p(a(A)  ,  Q((A' ,  p  (^P(A)  ,  P(A')^  j  . 

Then  P  has  the  same  properties  as  those  p  has  on  R  so  that  P  is 
a  metric  on  Jl  . 

As  direct  consequences  of  the  definitions  of  p,P,a,P,7,c  we 
have 

(l-l)  The  function  p  :  R  — ►Jl  is  isometric;  that  meems,  for  euiy 
x,x'eR  , 

p  (^p(x)  ,  p(x'  ))  =  p(x,x' )  . 

Hence  p  maps  R  homeomorphically  onto  p(R)  . 

(1-2)  Whenever  A,A'€«ll  , 

p  (a(A)  ,  a(A')^  P(A,A’)  . 

Hence  the  function  a  :  Jl — ►  R  is  uniformly  continuous^  that  meems,  for 
any  £  >  0  there  is  a  5  >  0  such  that  ^enever  A, A  etS  with 
P(A,A  )  <  6,  we  have  p  ^a(A)  ,  a(A  )^  <  £  . 

Since  viniformly  continuous  functions  are  continuous,  it  follows  that: 

(1-3)  The  function  a  :«P— ►R  is  continuous,  that  means,  for  any 
As  j)  and  any  £  >  0  there  is  a  6  >  0  such  that  whenever  A'  €#)/  with 
P(A,A')  <  6  we  have  p  (a(A)  ,  a(A' )^  <  6  • 

Just  as  (1-2)  and  (1-3),  we  have 
(1-4)  Whenever  A,A'e*P  , 

p  (&{A)  ,  p(A'))<  P(A,A')  . 
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Hence  the  function  p:»J  — ►  R  is  \iniformly  continuous  and  consequently 
it  is  continuous. 

Using  (1-2),  (1-4)  and  well-known  properties  of  real  numbers,  we  have 

(1-5)  The  functions  y  :  «!)  — ►  R  and  a  :  tP  —*■  R  are  viniformly  con¬ 
tinuous  and  hence  they  are  continuous. 

The  following  will  be  needed  later 

(i-6)  Let  AjA'cJP  and  let  a  >  0  .  Then  P(A,A' )  <  a  if  and  only 
if  the  following  two  conditions  hold. 

(i)  For  every  xeA  there  is  some  x’cA’  with  p(x,x’)  <  a  . 

(ii)  For  every  y'cA'  there  is  some  yeA  with  p(y',y)  <  a  . 

Whenever  leJ?  we  let 

cPj  =  (Ae.fi  1  Acl}  . 

(1-7)  Whenever  leJ?  ,  is  compact;  that  means,  every  sequence  in 

fSlj  contains  a  convergent  subsequence. 

Let  ^  be  the  subset  of  *1?  consisting  of  all  the  elements  of  Jl  not 
containing  0  . 

(1-8)  The  set  ^  is  open  in  ;  that  means,  for  every  Acp-  there 

is  a  positive  number  7^  such  that  every  A'eJ)  with  P(A,A' )  <  7^ 
belongs  to  J-.  In  fact,  we  may  choose  yp^  =  min  (|a(A)l,  |p(A)|)  . 
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2.  ADDITION 


There  is  a  function 

©  :  Si  X  Jl  S. 


defined  by 

®  (A,B)  =  (x  +  y  I  xcA  and  yeB) 

=  [a(A)  +  a(B)  ,  3(A)  +  3(B)]  ,  A,BeJ)  . 

The  set  ®(A,B)  is  also  written  A  ®  B  .  The  function  ®  is  called  the 
addition  on  J? . 

(2-1)  The  function  p  ;  preserves  the  addition;  that  means, 

for  any  x,y€R  , 


P  (x)  ®  p  (y)  =  p  (x  +  y)  . 

Because  of  (2-1),  the  addition  ®  on  S  may  be  regarded  as  an  extension 
of  the  addition  +  on  R  .  This  is  the  reason  for  calling  ®  the 
"addition"  on  S  . 

(2-2)  The  addition  ®  is  commutative;  that  means,  for  any  A,BeS  , 

A  ®  B  =  B  ®  A  . 

(2-3)  The  addition  ®  is  associative;  that  means,  for  any  A,B,C€ej?  , 

A  ®  (B  ®  C)  =  (A  ®  B)  ®C  . 

(2-4)  Whenever  (A,B)  ,  (a',B')  iS  xS  , 

P(A  ®  B  ,  A'  ®  B' )  ^  P(A,A' )  +  P(B,B'  )  . 
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Hence  the  addition  ©  is  uniformly  continuous;  that  means,  for  euiy 
£  >  0  there  is  a  6  >  0  such  that  whenever  (A,B)  ,  (A',B')  with 

P  (A, a'  )  <  B  and  P  (B,B' )  <  B  ,  we  have  P  (A  ®  B  ,  A'  ©  B' )  < £  . 


Proof.  Let 

A  =  Ca,h]  ,  A'  =  ,  B  =  [c,d]  ,  B’  -  [c’,d-]  . 

Then 

P  (A  ©  B  ,  A'  ©  B' )  =  P 

=  max  {p(a  +  c  ,  a'  +  c*)  ,  pCh  +  d^h'  +d')) 

<  max  {  p  (a, a' )  +  p(c,c' )  ,  )  +  p(d,d' )) 

<  max  (  p  (a,a' )  ,  p(h,b' ))  +  max  {  p  (c,c' )  ,  p  (d,d' )) 

=  P  (A, A' )  +  P  (B,B' )  . 

To  prove  the  viniform  continuity  we  have  only  to  pick  B  =  C/2  .  q.e.d. 

Since  uniformly  continuous  functions  are  continuous,  it  follows  that 

(2-5)  The  addition  ©  is  continuous;  that  means,  for  any  (A,B)€t5xJ? 
and  any  £  >  0  there  is  a  6  >  0  such  that  whenever  (a',B') 
with  P  (A,A'  )  <  B  and  P  (B,B'  )  <  6  ,  we  have  P  (A  ©  B  ,  A'  ©  B' )  <  €  . 

5.  MULTIPLICATION 

Whenever  A,Be«$  we  let 

®  (A,B)  =  (xy  j  xeA  and  yeB)  . 

We  claim  that  ®  (A,B)  . 


a  +  c  ,  b  +  d]  ,  [a'  +  c'  ,  b'  +  d 


■3) 
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Let  A  =  and  B  =  .  Then  we  have  the  following  cases 

(1)  If  a  >  0  and  c  >  0  ,  then 

®  (A,B)  *  [ac,hd]  . 

(2)  If  a  >,  0  and  c  <  0  <  d  ,  then 

®  (A,B)  =  [hc,hd]  . 

(5)  If  a  >  0  and  d  <  0  ,  then 

®  (A,B)  =  [[he,  ad]  . 

(U)  If  a  <  0  <  h  and  c  >  0  ,  then 
®  (A,B)  »  [[ad,hd]  . 

(5)  If  a  <  0  <  h  and  c  <  0  <  d  ,  then 

®  (A,B)  =  []min  (he, ad)  ,  max  (ac,hd)]  . 

(6)  If  a  <  0  <  h  and  d  <  0  ,  then 

®  (A,B)  =  ^hc,ac]  . 

(7)  If  b  <  0  and  c  >  0  ,  then 

®  (A,B)  =  [^ad,bc]  . 

(8)  If  b  <  0  and  c  <  0  <  d  ,  then 

®  (A,B)  =  [[ad,ac]  . 

(9)  If  b  <  0  and  d  <  0  ,  then 

®  (A,B)  =  [bd,ac]  . 
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Because  of  this  result,  we  have  a  function 

®  :  Jl  X 

called  the  multiplication  on  Jl  .  Whenever  A,Beefi  ,  the  set  ®  (A,B)  is 
also  written  A  ®  B  . 

(5--I)  The  function  p  :  R — preserves  the  multiplication. 

Because  of  (3-1)  the  multiplication  ®  on  may  be  regarded  as  an  ex¬ 
tension  of  the  multiplication  on  R  . 

(3-2)  The  multiplication  ®  is  commutative;  that  means,  for  any 
A,B€eJ  , 


A  ®  B  -  B  ®  A  . 

(3-3)  The  multiplication  ®  is  associative;  that  means,  for  any 
A,B,C€jl  , 


A  ®  (B  ®  C)  =  (A  ®  B)  ®  C 
(3-4)  For  any  A,B,C€«P  , 

A  ®  (B  ©  C)  CZ(A  ®  B)  ©  (A  ®  C) 


but  both  sides  may  not  be  equal.  Hence  the  distributive  law  does  not  hold. 
(3-5)  Whenever  (A,B)  ,  (a' ,B' )  eeSxej? 

P(A  ®  B  ,  A'  ®  B')  <  7  (B)  P(A,A')  +  7  (A’ )  P(B,B’)  . 

Hence  the  multiplication  is  continuous. 

Proof.  Let  xyeA  ®  B  ,  where  xeA  and  yeB  .  By  (I-6),  there  is  some 
x'eA'  and  some  y'cB'  such  that  p(x,x')  ^  P(A,A')  and 
p  (y,y')  <  P  (B,B')  . 
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Therefore  x'y'cA'  QSt  B'  and 

p  (  xy  ,  x'y' )  =  |xy  -  x'y' I 

=  |y(x  -  x’)  +  x'(y  -  y')| 

$  |y|.|x  -  X’ I  +  |x’ |. |y  -  y' [ 

^r(B)  P(A,A')  +y  (A')  P(B,B')  . 

Let  x'y'cA'  ®  B'  ,  where  x'eA'  and  y'cB'  .  Similarly  there  is  some 
xycA  ®  B  such  that  xcA  ,  yeB  and 

p(x'y'  ,  xy)  <  y(B)  P(A,A)  +  7(A’ )  P(B,B' )  . 

Making  use  of  (1-6)  again,  these  results  imply  the  first  part  of  (5-^5)* 

To  prove  the  continuity  of  ®  ,  we  let  (A,B)ej?x<J?  and  let  £>  0  . 

Take 

6  =  min  |&/(7(A)  +  7(B)  +  1^  ,  l|  . 

Then  for  any  (A*  ,B' )  €  el?  x«l?  with  P(A,A')  <  6  and  P  (  B,B' )  <  6  we 
have 

P  (A  ®  B  ,  A' ®  B')  <7(B)6  +  7(aM5  ^7(b)6+  ^7(A)+6)6 
^(yCA)  +7(B)  +  i)b<£  .  q.e.d. 

It  is  not  hard  to  see  that  the  multiplication  is  not  uniformly  continuous. 
However,  since  the  restriction  of  a  continuous  function  on  a  compact  set 
is  viniformly  continuous,  it  follows  from  (1-7)  anti  (5-5)  that 

(3-6)  Let  I  and  J  he  fixed  elements  of  Jl  .  Th  the  multiplication 

®  :  «l?i  X  — ►  tS 

-10- 
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is  unifonnly  continuous  In  fact,  for  any  A,A'ei  and  B,B'€J  we 
have 

P(A  ®  B  ,  A'  ®  B’ )  <  7(J)  P(A,A')  +7(1)  P(B,B’ )  . 


1^.  SUBTRACTION 
By  (3“5)  we  have 

(4~1)  Let  E  he  a  fixed  element  of  Si  .  Then  the  function  ®  g  of 
<j!  into  «!?  ,  defined  by 

®  g  (  a)  =  E  ®  a  ,  AceP  , 

is  uniformly  continuous. 

In  particular,  if  E  =  [|-1,  -ij  ,  we  have 

(4-2)  The  function  ®  'Si  ^  >  defined  by 

®  [_x,  — 1]  ~  ®  A  ,  ACeP  , 

is  uniformly  continuous. 

Whenever  AeJ?  ,  we  shall  abbreviate  ®  1]  -A  .  Since 

-(-A)  =  A  ,  ®  is  a  horaeomorphism. 


Combining  the  addition  0  and  the  function  ®  j  we  define  the 


subtraction 


©  :  «J?  X  4P  S 


©  (A,B)  =  a  ©  (-B)  =  (x  -  y  j  xeA  and  yeB}  ,  A,Bei? 
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The  set  ©  (A,B)  is  also  written  A  ©  B  . 


From  (2-1)  and  (3-1)^  it  follows 

(4-3)  The  function  p  :  R  — ►«(?  preserves  the  subtraction. 
(4-4)  Whenever  , 

(-A)  ®  B  =  A  ®  (--B)  =  -  (A  ®  B)  , 

(-A)  ®  (-B)  =  A  ®  B  . 


From  (2-4)  and  (4-2)  it  follows 

(4—5)  The  subtraction  ©  is  tiniformly  continuous  and  hence  is  con¬ 
tinuous.  In  fact,  for  any  (A,B)  ,  (A*  ,B' )  e  x , 

P(  A©B  ,  A’  ©B')  <P(A,A')  +  P(B,B’)  . 


5 .  DIVISION 


For  every  A  =  [a,b]€j  (  see  §1), 


is  in  (J-.  Hence  we  have  a  function  defined  by 


T (A)  =  A'^  . 


(5-1)  Whenever  Je^  ,  «^j  A,A'e  li/j  , 

p(a-^  ,  A-‘)<r(j-^)"p(A,A')  . 

Hence  t  is  uniformly  continuous  on  ,  and  consequently  it  is 

continuous  on 
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Proof.  Let  AjA'ccfij  .  For  any  xeA  there  is,  by  (1-6),  some  x’eA' 
such  that  p  (  X ,x' )  ^  P  (  A,A' )  so  that 

P  =  I  -  x'~^  I  =  j  x"^  I  I  x'“^  I  X  -  x'  1 

<  7(  J  P  (A, A’  )  . 

Similarly,  for  any  y'eA'  there  is  some  yeA  such  that 
p  (y''\y'")  <  7  (j)^  P(A,A')  .  Hence  the  first  part  is  proved. 

To  prove  the  uniform  continuity  of  t  on  ,  Je^  ,  we  let  t  >  0  and 
teike 

6  =  t/y(j~')^  . 

It  is  clear  that  for  any  A,A'e«flj  with  P(A,A')  ^  6  ,  we  have 

p(A-hA'-^)<r(j-^)=5  =  t. 

To  prove  the  continuity  of  t  we  let  Ae^  and  let  £  >  0  .  Take  a 
such  that 

a(  J)  <  a(A)  <  S(A)  <  S(  J)  . 

Then  for  any  A'  eJ?  with  P  (A,  A' )  <  min  {a(A  )  -  a  ( J)  ,  S  ( J  )  -  S  (A) )  , 

A'c^.  Hence  the  continuity  of  t  on  Jlj  implies  the  continuity  of 
T  at  A  .  q.e.d. 

Since  (a-)-  =  A,A€^,  t  is  a  homeomorphism. 

Combining  the  multiplication  ®  and  the  function  t  we  define  the  division 

©  ;  X  ^1  el? 
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i3y 

®  (A,B)  =  A  ®  I  xeA  and  yeB^-,  Acifi  and  Be^  . 

The  set  ®  (A,B)  is  also  written  a/b  . 

Since  for  any  real  nixmber  x  f  0  , 

p(xr"  =  p(^“0  > 

it  follows  from  (3“l)  that 

(5-2)  The  function  p  preserves  the  division  ®  . 

From  (3-6)  and  (5-I)  it  follows 
(5-3)  Whenever  lejJ  and  J€^, 

0  :  «llj  X  «Jj  — ►*? 

is  unifonnly  continuous.  In  fact,  for  any  (A,B)  ,  (a',B')  ecHj  x  Jj 
we  have 

p  (a/b,a7b')  <  p(  a,a')  +  7(1)  p  (b,b')  . 

Hence  ©  :  S  X  is  continuous. 

6.  ARITHMETIC  FUNCTIONS 


Before  giving  the  definition  of  an  arithmetic  function  we  remark  that  every 
arithmetic  function  has  a  domain  contained  in  e?  ,  an  order  which  is  a  non¬ 
negative  integer,  and  a  finite  number  of  parameters  which  are  elements  of 
cP  .  An  arithmetic  function  of  order  n  with  parameters  X^,kp,  ...  Am 
(n) 

is  written  F»  *  •  or  simply  F  .  It  is  a  function  from  its  domain 

■^1^2  •  ■  • 

B(F)  to  cP  . 
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An  arithmetic  function  of  order  n  is  defined  by  induction  on  n  .  When 
n  =  0  ,  the  number  m  of  parameters  is  either  0  or  1  .  In  the  former 
case  it  is  the  identity  function  :  J?  — ►  ^  given  by 

F^°^(X)  =  X  ,  XeJ/  . 

In  the  latter  case  it  is  the  constant  function  :  »{  — ►  J?  given  by 

f[°)(X)  =  a  ,  XeJ?  , 

where  A  is  eui  arbitrary  element  of  A  .  Notice  that  every  arithmetic 
function  of  order  0  has  J  as  its  domain. 


Let  n  be  a  positive  integer  and  suppose  that  arithmetic  functions  of 

order  <n  have  been  defined.  Then  every  arithmetic  function 

'P\  i  a  order  n  is  defined  as  follows.  There  is  an  arithmetic 

function  i  .  of  order  s  ,  0  <  s  ^  n-1  ,  with  parameters 

Aj^^Ag,  A.  ,  0<:i<m,  and  an  arithmetic  function  a 

i+l  i+2  ^ 

of  order  n-l-s  with  parameters  ^  that 

,(n) 


p(«) 


Fa**»  a  =  °  ^  A  is  given  by 

Ma  Ms  •••  ^i+A+2  \i 


5,(n) 

'a  a 


,  (X)  , 


A,A„  ...  ...  A^ 


where  o  is  one  of  ®  ,  ©  ,  ®  ^  ©  and  X  is  such  an  element  of  S  that 
the  right  side  is  well-defined.  It  is  clear  that  if  B'l  and  4  are  the 

respective  domains  of  Fa^a  a  ^a^~^a^^  a  >  i^®n  the 

domain  S'  of  F^^^^  is  given  by 

^12  ' ■ ' 

(g.)  if  o  is  ©  , 
if  otherwise. 
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Let  F  be  an  arithmetic  function  of  parameters  •••,  k^  and 

domain  J^(F)  .  We  may  write 


•••>  Ajjj  X) 

instead  of  F(X)  and  consider  F  as  a  function  of  X  j9(F)  into  ^  . 
Notice  that  J&(F)  depends  on  AljAg,  .  .  .  ,  Ajjj  . 

(6-1)  If  F(B^,B2,  Bm  ]  Y)  is  defined,  then  for  any 

‘^^‘®Bni>  2€^Y  >  F(Ci,C2,  Cj„  ;  Z)  is  defined 

and  is  contained  in  F(Bj^,  Bjjj  ;  Y)  . 

Proof.  If  F  is  of  order  0  ,  our  assertion  is  trivial.  Hence  our 
assertion  holds  for  arithmetic  functions  of  order  0  . 

Now  we  proceed  by  induction.  Let  n  be  a  positive  integer  euid  assume 
our  assertion  for  all  arithmetic  functions  of  order  <  n  .  By  definition 
every  arithmetic  function  P  of  order  n  with  parameters  k^fk^,  \ 

is  given  by 

F  ( Aj^  f  k^f  •  •  •  /  Ajjj  J  X)  =  Fj^(Aj^,  •  • .  ^  Aj^  ;  X)  ®  >  Ajjj ;  X)  , 

where  Fj^  and  Pg  are  arithmetic  functions  of  order  <  n  and  o  is  one 

of  ®  ,  0  ,  ®  ,  ®  . 

If  F(Bj^,Bg,  ...,  Bjn  J  Y)  is  defined,  then  Fj^(Bj^,  ...,  ;  Y)  and 

FgCSjj^+j^/  •••/  Bffl  ^  is  in  ^  when  o  is  @  .  By  the  induction 

hypothesis,  for  any  ...,  ,  Zeofy  ^  ...,C^;Z) 

auid  FgCC^^i,  ...,  C^;  Z)  are  defined  and 

Fi(Ci,  •  •  ’  f  ;Z)cFj^(Bj^,  Bjj^;  Y)  , 

•••,  Bjjj  j  Y)  . 
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So  P2(C|^^.i^  -  ’  • ,  Ciii;Z)€^  when  o  is  ©  .  Hence 

•  •  •>  2  •  •  •>  Cj^  >  Z)  o  '  •  • )  Cjh  >  ^) 

is  defined  eind  is  contained  in  F  ...,  ;  Y)  .  q.e.d. 

(6-2)  If  P  (  Bj^,  ...,  Bjj, ;  Y)  is  defined,  then  there  is  a  number  k>0 
such  that  whenever  Aj^,  k^€.A-Q^,  X^X's^y  > 

P  (rik^,  ...,  Am;  X)  ,  F(A[,  ...,  Ai;  X’)^ 

<  k  (p  (Ai,Ai)  +  ...  +  P(Am,Ai)  +  P(X,X')^  . 

Hence  P  is  uniformly  continuous  on  x  . . .  x  x  rfy  . 

Proof.  When  F  is  of  order  o  it  is  evident  that  the  inequality  holds 
with  k  =  1  .  Therefore  we  may  proceed  by  induction  eind  assume  the  in- 
equeility  for  arithmetic  functions  of  order  <  n  . 

Every  arithmetic  function  F  of  order  n  is  given  by 

F  (  Aj_,  • '  •  f  Ajjj  J  X)  =  ' ' '  f  ;  X)  o  •  •  •  >  ^m  '  > 

•irtxere  Fj^  and  Fg  are  arithmetic  functions  of  order  <  n  and  o  is 
one  of®,©,®,©  .  By  the  Induction  hypothesis,  there  exist  positive 

numbers  k^^  and  kg  such  that  whenever  Aj^,  A'jSellg^,  ...,  Ajji,Ai€»Jg^  , 

x,x'et?y  ,  we  have 

P  •  j  Aj^  j  X)  ,  ‘  ^ 

^  k^(p(A^,A;)  +  ...  +  P(Aj^,Ai)  +  P(X,X’)^ 

^  Xg^P(A^^^,A^^^)  +  ...  +  P(Am,A;)  +  P(X,X')^  . 
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By  {2-k),  (3-6),  (^5),  (5“3),  there  exists  a  positive  number  k  for 
which  our  desired  inequality  holds.  In  fact,  we  can  let 

2(ki  +  kg)  ^7  (J‘^)  +  y  (I)")  if  0  is  ®  , 

I  2  (kj^  +  kg)  [y  {l)  +  7(J)  +  ly  if  otherwise, 

where 

I  =  B^jY)  ,  J  =  )  ®ni  >  • 

q.e.d. 

(6-3)  If  F(B^,  ...,  BjnJ  Y)  is  defined,  then  there  is  a  6  >  0  such 
that  whenever  A^,  ...,  A^,  X€«C  with 

P(A^,B^)  <  S  ,  ...,  P(VBm)  <  5  ,  P(X,Y)  <  6  , 

P(Aj^,  ...,  Ami  X)  is  defined  and  is  continuous. 

Proof.  If  F  is  of  order  0  ,  then  our  assertion  is  trivial.  Therefore, 
we  may  proceed  by  induction  and  assume  our  assertion  for  arithmetic 
functions  of  order  <  n  ,  n  >  0  . 

Every  arithmetic  function  F  of  order  n  is  given  by 

F  (Aj^,  • .  • ,  ^  j  X)  =  Fj^( A]_,  •  •  • ,  Ajj  ;  X)  o  F 2(  .  •  • ,  ^  ;  X)  , 

where  F^^  and  Fg  are  arithmetic  functions  of  order  <  n  and  o  is  one 

of  ©  ,  e  ,  ®  ®  . 


By  the  induction  hypothesis  there  is  a  positive  number  6  such  that  when¬ 


ever  Ai,  ...,  Ajjj,  Xsod 
both  Fi(Ai,  ...,  Ai;X) 
tinuous .  If 

continuous. 


with  P(Aj^,Bj^)  <  5  ,  ...,  P 
and  F2(A^_j_j^,  •  •  ,  Ajjj  j  X) 
.,  Bj,j;Y)e^,  we  may  choose 
Hence  F  (A  i,  . . .,  A„j  j  X) 
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It  follows  from  (6-1),  (6-2)  and  (6-5)  that 


(6-^)  Let  F  be  an  arithmetic  function  with  m  parameters  and  let 
U  =  ((Aj,  A^;  X)  F(Ai,  . . . ,  ;  X  )  defined)  . 


Then  U  is  open  in 


»m+i 


and  F  is  a  continuous  function  of  U  into 


«2  .  Moreover,  for  every  (Bj^,  ...,  Bm;  Y)  e  U  ,  ^  U 

and  F  is  uniformly  continuous  on  tUg  x  ...  xj)g  xJly  .  Furthermore, 

1  m 

(Aj^,  ...,  A^;X)e*l?B^x  ...  xel?g^x«fiy  implies 


F(Ai, 


;  X)CF(B^,  ...,  Bn;  Y) 


When  we  write  F(X)  in  place  of  F(Aj^,  ...,  A^;  X)  it  is  understood  that 
parameters  Aj^,  A^  are  fixed.  Since 

B(F)  =  {Xcjll  F(X)  defined)  , 

it  follows  from  (6-4)  and  (6-2)  that 

(6-5)  For  every  arithmetic  function  F  ,  B(F)  is  open  in  <2  and 
F:J&’(F)->J!  is  continuous.  Let  IeJ&(F)  .  Then  d  J&(F)  and  Xe«Pj 
implies  F(X)CF(l)  .  Moreover,  there  is  a  positive  number  k  such 
that  whenever  XjX'eJIj  , 

P  (f(X),  F(X’))  ^  kP  (X,X')  . 

7.  RELATION  BETWEEN  ARITHMETIC  FUNCTIONS  AND  RATIONAL  FUNCTIONS. 


In  the  construction  of  arithmetic  functions,  if  we  reple  ::e  by  R  and 
replace  by  corresponding  operations  on  R  ,  then  we  obtain 

ratloneO.  functions  in  place  of  arithmetic  functions.  Therefore,  we  can 
establish  a  relation  between  arithmetic  functions  and  rational  function. 
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An  arithmetic  function  is  called  special  if  all  of  its  parameters  belong 
to  p  (R)  • 

(7“l)  If  F  is  a  special  arithmetic  function  of  domain  j9'(F)  ,  then 
p"^(j&(F))  is  open  in  R  and  for  every  x€p~^^(F)^  ,  Fp(x)ep(R)  . 
Moreover,  there  is  a  unique  rational  function  f  whose  domain  contains 
P^^^Cf))  such  that  pf  =  Fp  or  f  =  p~^Fp  . 


p"^^(F)) — 2 - ► 


,! 

F 

-&(F) 


As  the  converse  of  (7-l),  we  have 

(7-2)  Given  any  rational  function  f  of  domain  D(f)  there  is  a 
special  arithmetic  function  F  of  domain  ^tF)3p^D(f)^  such  that 
f  =  p“^Fp  . 

Remark.  It  is  possible  to  have  two  distinct  special  arithmetic  functions 
Fji  and  Fg  such  that  p”^Fj^p  =  p”*^FgP  .  For  example, 


F^(X)  =  (X®  X)  ©X  , 

Fg(X)  =  X®  (x@p(l))  ,  XeJ!  , 

give  two  arithmetic  functions  F^^  and  F^  of  domctin  «?  .  It  it  clear 
that 

(p"Xp)(x)  =  x^  +  X  =  x(x  +  1)  =  iv~^F^p)U)  . 


Since 

FiCC-l  .  0])  >  [-1 . 1]  . 

Fj^  and  Fg  are  distinct. 
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Let 


I 

1 

I 

I 

I 

I 

I 


! 


Let  F  be  an  arithmetic  function  with  parameters  A^,  Aj^  . 

G  be  the  arithmetic  function  with  in  place  of  Aj„ 

respectively;  that  means,  it  is  given  by 

G(X)  =  F  (Bi,  Bm;  X)  . 

If  Bj^CZ  Aj^,  Bj^CTA^  ,  then,  by  (6-4),  J&(F)cz^(G)  and  for  every 

X€J&(F)  ,  G(X)c:P(X)  .  Hence  the  relation  that  Aj^,  ...,  BjnCAjn 

will  be  written 

G  C  F  . 

Let  f  be  a  rational  function  and  let  F  be  an  arithmetic  function.  If 
there  is  a  special  arithmetic  function  G  C  F  with  f  =  p  ^Gp  ,  we  say 
that  f  is  an  associated  rational  function  of  F  or  that  F  is  eui 
associated  arithmetic  function  of  f  .  In  particular,  if  G  =  F  and  then 
f sp”^Pp  ,  we  say  that  F  is  an  associated  special  arithmetic  function  of 
f  and  f  is  the  associated  rational  function  of  F  . 


1 

I 

i 

L 


8.  FIRST  APPROXIMAriON  THEOREM 


Let  XeJl  .  By  a  subdivision  of  X  we  mean 


such  that 


a(X)  =  a(li)  <p(li)  =a(|g)  <S(l2)  =0(13)  < 

...  <3(lr-J  =a(lr^  ’ 

For  every  subdivision  |  = 

0(1)  =  max|a(|^),  0(1^),  ...,  o(lr)|  • 


LOCKHEED  AIRCRAFT  CORFORATION 


-21- 


missiles  and  SPACE  DIVISION 


Let  f  be  a  rational  function  of  domain  D(f)  ;  let  G  be  an  associated 
special  arithmetic  function  of  f  and  let  F  be  an  associated  arithmetic 
function  of  f  with  P  IDG  (see  §7)* 

Let  I  be  an  element  of  aS  contained  in  D(f)  .  Then  for  every  xetSi  , 
Gp  (x)  =  pf  (x)  is  well-defined.  Since,  by  (6-5)/  the  domain  J&(G)  of 
G  is  open,  there  is,  for  every  xel  ,  a  positive  niunber  r^^  such  that 
whenever  Y€»2  with  P(p(x),  Y  )  <  r^  ,  G(Y)  is  well-defined.  Let 

Ix  =  [x  -  rx/2  ,  x+  rx/2]  . 


Then  G(Ix)  is  well-defined. 


Since  I  is  compact,  there  exist  a  finite  number  of  points  of  I  ,  say 
Xj^,  Xg,  . . . ,  x^  ,  such  that  I  is  contained  in  the  \inion  of  the  interior 

Qi  =  (^i-^Xi/s^Xi+rx/s) 

of  Ixj^  ,  i  =  1,  . . . ,  r't  .  We  abbreviate  Ix^  by  . 


Let  . . . ,  B[jj  be  the  parameters  of  G  and  let  •••/  Am  be  the 

parameters  of  F  .  By  definition, 

^1,  •••> 

Bj_d  •  •  • ,  J 

G(Y)  =  F(Bi,  ...,  Y)  , 

F(Y)  =  F(A^,  ...,  A^J  Y)  , 


Y€J9(G)  J 
Y€J9(F)  . 


By  (6-5)  there  is,  for  every  i=l,  ...,  t,a  6j^>0  such  that  whenever 
a(Ai)<&i,  ...,  a(Am)  <  5jL  ,  F(li)  is  defined.  Let 


35 


min 
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Then  whenever  o(Ai)  <  56  ,  •••,  <j(Ani)  <  36  ,  F(l^)  is  defined  for 
all  i  . 

By  a  well-known  theorem  on  compact  metric  spaces  there  exists  a  6'  >  0 
such  that  whenever  YcJlj  with  a(Y)  <  6'  ,  Y  is  contained  in  one  of 
It  so  that  P(Y)  is  defined  by  (6-5). 

Let  X€*Pj  and  let  |  =  he  a  subdivision  of  X  with 

o(|)  <  6'  .  Then 

F(X,|)  =  F(|JU  ...U  F(t^)  , 

E(F,X,|)=  (f(|J  ®  pa(lj)  @  ...  ®  ®po(lr)) 

are  well-defined  •vdien  the  parameters  ...,  Am  satisfy 

o(Aj^ )<36^  •••>  ® ( Am^  <36. 

Clearly  S(F,X,|)€«f  .  Since  for  every  i  =  2,  r  , 

n  F(|j)ciF(pa(|j))  4  , 

it  follows  that  F(X, . 

By  (6-2),  there  is,  for  every  i  =  1,  ...,  t  ,  a  positive  niimber  kj^ 
such  that  for  any  y,Y'e»Jij^  , 

p(g(y),  G(Y'  ))  ^  kiP(Y,Y' )  , 
p(f(y),  G(Y))  ^  ki(cT(Ai)  +  ...  +  a(Amj) 

hold  where  the  parameters  satisfy 

^(Aj_)  <  6,  ...,  cr(Ajjj)  <  6  , 

and  kj^  is  independent  of  the  parameters. 
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Let  k  =  max  ^  . 

Then  for  any  Y,Y'€t?i  with  a(Y)  <  6  and  a(Y' )  <  6' 

p(f(Y),  G(Y’))  ^  k(p(Y,Y’)  +  o(Aj  +  ...  +  o(A^j)  . 

In  fact,  there  is  a  finite  sequence 

Yi  =  Y,Y^,  ...,  Y3  =  Y’ 

in  «9j  such  that 

aCYg)  =  ...  =  o(Yg)  =  a(Y')  , 
a(Y2)  <  ...  <  a(Yg)  =  a(Y') 

and  and  Yj  are  contained  in  the  same  for  some  i  .  Hence 

kP(y,  Y’)  =  k^(Yj_,Ys)  +  ...  +  P(Yg_^,Yg)) 

p(g(yJ,  G(Y2))  +  ...  +  p(G(Yg_;^),  G(Yg)) 

>  p(g(y),  G(Y'))  . 

Moreover 

k(^a(A  )  +  ...  +  a(A^)^  >  p(f(Y)  ,  G{Y^  . 

Our  assertion  thus  follows. 

Now  we  are  ready  to  prove 


Theorem  1.  Let  f  be  a  rational  function  of  domain  D(f)  ,  let  G  be 
an  associated  special  arithmetic  function  of  f  and  let  I  be  an  element 
of  ($.  contained  in  D(  f )  .  Then  there  are  positive  numbers  6  ,  5 '  ,  k 
such  that  whenever  F  is  an  associated  arithmetic  function  of  f  with 
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I 

I 

I 

I 


parameters  Aj^  such  that  F  IDG  and  o(Aj^)  <  6,  a(Aj^)<&, 

X  Is  an  element  of  d?j  and  I  =  Is  a  subdivision  of  X 

with  ff(S)  <  6'  , 

F(X,I)  =  F(lj^)  U  ...  U  Fdy) 

is  defined  and  satisfies 

f(X)C  F(Xd)c:  f(X)  ©  [-<,  , 

where 

f(X)  =  |f(x)  I  x€xj-  , 

K  =  k(a(|)  +  a(A^)  +  ...  +  a(A^))  . 

Proof.  Let  6,5',k  he  chosen  as  above.  Let  F  be  an  associated  arith¬ 
metic  function  of  f  with  parameters  •  •  • ;  An  such  that  FC  G  and 

ct(A3^)  <6,  ...,  c(Aji)  <  6  .  Let  XeJlj  and  let  |  be  a  subdivision  of 

X  with  a(i)<5'  .  We  have  shown  that  F(X, I)  is  defined. 

For  every  y€f(X)  there  is  an  xeX  with  f(x)  =  y  .  Let  xe|j  .  Then, 

(6-5), 

y  =  f(x)eFp(x)CF(|j)CZF(X,|)  . 

Hence  f(X)CZF(X,|)  . 

For  every  yeF(X, I)  ,  there  is  a  with  yeF(lj)  .  Let  xe|j  .  Then 

p(lj,p(x))  <  a(lj)  <  a(l)  .  It  follows  that 

Gp(x))  <:  k(a(|)  +  o(A^)  +  ...  +  a(Ani)) 
or 

P^F(lj)  ,  pf(x))  ^  K  . 
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Therefore 


y€F(|j)ci  Pf(x)  /c3 

C  f(X)  k2 

Hence  F(x,Oc:  f(x)  ©[-/f ,  k]  . 


q.e.d. 


Corollary  1.  Let  f  be  a  rational  function  of  domain  D(f)  ,  let  G  be 
an  associated  special  arithmetic  function  of  f  auid  let  X  be  an  element 
of  J  contained  in  D(f)  .  Let  F  be  an  associated  arithmetic  function 
of  f  with  parameters  such  that  F  IDG  .  Then  whenever  | 

is  a  subdivision  of  X  with  small  o(|)  and  a(Aj^),  °(Aq)  are  small, 

F(X, l)  is  defined.  Moreover,  as  o(|)  +  a{A^)  +  ...  +  a(Ani) — , 

lim  F(X,|)  =  f(x)  , 


that  means, 

lim  p(f(X,|)  ,  f(X))  =  0  . 


9.  SECOND  APPROXIMATION  THEOREM 

Theorem  2.  Let  f  be  a  rational  function  of  domain  D(f)  ,  let  G  be 
an  associated  special  arithmetic  function  of  f  and  let  I  be  an  element 
of  A  contained  in  D(f)  .  Then  there  are  positive  numbers  6,6*,  k  such 
that  \dienever  F  is  an  associated  arithmetic  function  of  F  with  parameters 
Ai,  A„  such  that  F  :dG  and  ct(Ai)  <6,  ...,  cr(Ajjj)  <  8,  X  = 

is  an  element  of  Aj  and  |  is  a  subdivision  of  X  , 

E(F,X,0  =  (f(4i)  ®  e  •••  ©(F(lr)  ®P<j(tr)) 

is  defined  and  satisfies 

p^^ f(x)d^C  Z(F,X,Oc:p(|^f(x)^)  +  [-Ko(X),  Ka(x)]  , 
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where 


K  =  k(a(|)  +  o(Aj^)  +  ...  +  o(A^)^  . 


Proof.  Let  5,6',k  be  as  before.  Let  F  be  an  associated  arithmetic 
function  of  F  with  parameters  ...,  such  that  FIDO  and 

o(Aj^)  <6,  ...,  «j(Ajj)  <  6  .  Let  X  =  ^a^b]  be  an  element  of  and 

let  I  =  1^1  ...,  be  a  subdivision  of  X  with  a(|)  <  6'  .  We 

have  shown  that  L(F,X, |)  is  defined. 


Since  |j  is  compact,  there  is  a  point  Xj  of  |j  with  f(xj)  =  “ij  • 
Then 

mj  =  f(xj)  eFp(xj)c::F(|^) 

so  that 

m^cClj)  €F(|^)  ®  pa(l^)  . 

Hence 

m  a(|  )€E(F,X,|)  . 

Similarly  we  can  show  that 

M  0(1  )eZ(F,X,|)  . 

J  J 
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Since  LiF,X,i)  cJl  ,  it  follows  that 


f(x)dx^CI  2;(F,x,0  . 


Let  Xj  be  as  above.  Since 


P  ^  ^  o(|)  , 


it  follows  that 


P  <  k^a(l)  +  o(A^)  +  ...  +  o(A^)^  = 


so  that 


F(lj)  CZ  p(mj)  ©  [-X,  <2 


Therefore 


Hence 


F(|j)  ®pa(|j)c:p^j  ©[-<a(§j),  <  cT(ej)]  • 

L(F,X,Oc:p(2j^3^  o(|^)^  ©[-Ka(X),  <a(X)]  . 


Similarly  we  can  prove  that 


E(F,X,|)C  p  m^  o(|^)^  ©  [-/c  o(X),  /co(X)]  . 

Since  Z(F,  X^O^«^  >  follows  that 

Z(F,X,0<=.P^/^  f(x)dx^  a(X),  ^fo(X)]  . 


Corollary.  Let  f  be  a  rational  function  of  domain  D(f)  ,  let 
an  associated  special  arithmetic  function  of  f  and  let  X  =  [|a,h] 
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eleiaent  of  Jl  contained  in  D(f)  .  Let  F  be  an  associated  arithmetic 
function  of  f  with  parameters  such  that  F  3  G  .  Then 

whenever  |  is  a  subdivision  of  X  with  small  a(|)  and 
o(Aj^),  oCAjj)  are  small,  z(F,X,4)  are  defined.  Moreover,  as 

ff(|)  +  o(Aj)  +  ...  +  o(Am)  — ►  0  , 


that  means 


11m 


i:(F,X,|) 


=  f(x)dx 
a 


> 


10.  APPROXIMATION  OP  A  CONTINUOUS  FUNCTION  BY  ARITHMETIC  FUNCTIONS 


Let  I  be  an  element  of  and  let  f  =  I — ►R  be  a  continuous  function. 
Let 

be  a  sequence  of  arithmetic  functions  such  that  p(l)  is  contained  in  the 
domain  for  all  n  .  If  for  every  xel  , 

F^p(x)3Fgp(x)  3  ... 

and 

we  say  that  Ifp^I"  converges  to  f  .  In  symbols. 


lim 
n— ►«> 


F 

n 


=  f  . 


(lO-l)  Let  f  be  a  rational  function  of  domain  D(f)  and  let  I  be 
an  element  of  contained  in  D(f)  .  Let  F  be  an  associated  arithmetic 
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function  of  f  with  parameters  Ajjj  ,  Then  for  every  £  >  0 

there  is  a  6  >  0  such  that  if  a(Aj^)  <  b,  o(Ajjj)  <  6  ,  then  for 

every  x€l  ,  Pp(x)  is  defined  and  o^(x)^  <  C  . 

Proof.  Let  G  he  the  associated  special  arithmetic  function  of  f  with 
PIDG  and  let  Bjn  he  the  parameters  of  G  .  By  (6-3)  there 

is,  for  every  x€l  ,  a  >  0  such  that  if  P(Aj^,Bj^)  < 

P(Aj^,Bjjj)  <  Sjj  and  X  is  an  element  of  j  with  P(x,p(x)^  <  Bx  , 

then  P(X)  is  defined  and  p(f(X),  Gp(x)^  <  C/p  .  Since  I  is  coii5)act 
there  exist  a  finite  number  of  points  of  I  ,  say  Xj^,  . , . ,  x-^  ,  such 
that  the  union  of 


"J  -  ^  ^x J  ^ 


j  =  1,  ,  .  , ,  t  , 


contains  I  .  Let 


B  =  min  *  •  •  >  * 

It  follows  that  if  P(Aj^,Bj^)  <  B  .  P(Ajjj,Bjji)  <  B  ,  then  for  every  xel  , 

a(^(x)^ 


<  e  . 


In  fact,  there  is  an  x.  with  P 


p(^(x),  pf(x)^  <  e/2  .  Hence 


(p(x)  ,  P(Xj)^ 


<  B.  so  that 
J 


f(^(x)) 


<  e  . 


q.e.d. 


Let  f  he  a  rational  function  of  domain  D(f)  and  let  I  he  an  element 
of  S  contained  in  D(f)  .  By  (lO-l)  we  can  easily  construct  a  sequence 
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of  arithmetic  functions 


Fi  3  Fg  3  . . . 


such  that  for  every  x€l 


n  =  1,  2 ,  . . . 


Hence 


11m 

n-^-flo 


Fn 


=  f  . 


This  result  can  he  extended  as  follovs. 

(10-2)  For  every  continuous  function  f  ;  I  — ►  R  ,  leJI  ,  there  is 
a  sequence  of  arithmetic  functions 

Fj^  ,  n  =  1 ,  2  ,  . . . 


with 


lim 

n-^oo 


Fn 


=  f 


Proof.  It  is  well-known  that  every  continuous  function  can  be  approxi¬ 
mated  by  polynomials.  For  every  Integer  n  >  1  we  let  be  a  poly¬ 

nomial  such  that  for  every  xel  ,  p(f„(x),  f(x)^  <  1/(6-5^)  .  By  (10-1), 

there  is  an  arithmetic  function  such  that  for  every  xel  ,  Gj^p(x) 

is  defined  and 

p(g„p(x),  pfn(x)^  <  1/(6-5'') 

Let  Fn  be  the  arithmetic  function  such  that 

Fn(X)  =  Gn(X)  ©[-V(6.5''),  VC^O’^)]  ,  X€B'(Gn)  • 
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Then  for  every  xel  ,  Fnp(x)  Is  defined.  Since 

p(g^p(x),  pf(x)^  <  p(Gj^p(x),  pfn(x))  +  P^fjj(x),  pf(x)) 
<  1/(6-5'')  +  1/(6.5")  ^  2/(6. 3'")  , 

It  follows  that 

pf(x)C  GnP(x)  ©[-2/(6.5’^),  2/(6.5’^)]  , 
G^p(x)C  pf(x)  ©[-2/(6.5’^),  2/(6.3'')]  . 

Hence 

pf(x)  ©  [-2/(6*3’^),  2/(6*3“)]c  PnP(x) 

C  pf(x)  ©  [-1/5“ ,  1/5”] 


Consequently 

P^^^p(x) C  pf (x)  ©  [-1/3”’^^ ,  l/5"‘^^]  C  Pjjp(x)  . 

Since  a^np(x)^  <  2/5”  and  2/3”  =  0  ,  it  follows  that  lim  Fj^=f 

^  •  6  •  d  * 


Let  I  he  an  element  of  d  and  let 


I  = 


he  fluhdivlsions  of  I  .  If  there  exist  integers  1 ^ j(l) < j(2) < . . .  <j(r) 


such  that  for  every  i  =  1, 
division  of  ,  we  write 


I  q  and  call  q  a  refinement  of  |  . 


Let  leJ  and  let  F  he  an  arithmetic  function  with  j&(F)=3p(l)  .  As 
before,  there  is  a  positive  ntmiber  6  such  that  whenever  XcJlj  with 
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a(X)  <  6  ,  P(X)  is  defined.  Hence  if  |  is  a  subdivision  of  I  with 
a(l)  <  6  f  both  F(I,0  and  Z(F,I.|)  are  defined. 


(10-3)  Let  F  be  an  arithmetic  function  and  let  |  1:^ 

be  a  subdivision  of  IcJ?  such  that  F(lj^),  ...,  F(|j.)  are  defined.  Then 
for  every  refinement  r\  =  ’Is]’  ^  >  F( ),...,  F(Tjg) 

defined  so  that  F(l, tj)  and  Z;(F,I,n)  are  defined.  Moreover, 


are 


f(i,ti)czf(i,0  , 


E(F,I,ii)CI  £(F,I,|)  . 


(10-4)  Let  leJ?  and  let  F  be  an  arithmetic  function.  Let 
I  =  ijl  be  a  subdivision  of  I  such  that  F(lj^),  F{|j.) 

are  defined,  and  let 

I  ... 

be  a  sequence  of  subdivisions  of  I  .  Then  and 

are  defined  for  all  n  and 

f(i,4^^0  Z)  ...  , 

z(f,I,|^"^)  Z5z(f,I,|(^^)  =»...  . 

=  0  ,  then  and  Ci 

{,"} . 


Tf  rt 


independent  of  the  choice  of  |  and 


are 


Proof.  Let  q  =  ,  •••,  Tig]"  be  a  subdivision  of  I  such  that 

•••,  F(qg)  are  defined  and  let  be  a  refinement 

of  T)  .  By  definition,  there  are  integers  1^  j(l)  <  ...  <  j(s)  =  t 
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is  a  sub- 


such  that  for  every  i  =  1,  s, 

division  of  .  It  follows  from  (6-5)  that 

Hence 

F(I,0  =  F(?J  U  ...  U  P(^^) 

CF(ti^)  U  ...  U  F(Tig)  =  F(I,ti) 


As  a  consequence  of  this  result  we  have 

f(i,|^^^)-.3f(i,|^^^)=5  ... 

Since 

“  (|(?J(i-i)«)  ®  ^<5j(i-i)«))  ®  ••  •  ®  (J(5j(i))  ®  ^43(i)|) 

(f'li))  ■’(^jd-iHi)  •••  *  “(^(ii))  ’’(^od)) 

=  a  (f(ti^))  o(ti^)  =  a^F(Ti^)  ®  pa(Ti^)^ 


and  similarly 
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it  follows  that 


C  F(ti^)  ®  pa(Ti^)  . 


Hence 


2(F,I,0  =  ©  ...  ®(f(^^)  ®  pa(C^j) 

<=  (f(t|i)  ®  po(tii))  ©  ...  ©  (F(ng)  ®  pa(ng))  =  2:(F,i,n) 


From  this  result  it  follows  that 


z(f,I,|^^^^  3... 

Now  we  assiune  =  0  .  Let  be  a  sub 

division  of  I  such  that  F(n  ),  ...,  F(ij  )  are  defined,  and  let 

X  '6 

H  H  ^  -s  •  •  • 


lim 


be  a  sequence  of  subdivisions  with  rT-^w  =  0  • 

Let  fi  >  0  .  For  any  ye  there  is  a  sequence 


(1)  (2) 
i(i)^^i(a) 


in  J  such  that  for  every  n  =  1-  2,  ....  and 

i(n) 

Since  ^  f  ^n-i(^i^n^  contains  a  single  point  x  . 

(6-5)  there  is  a  6  >  0  such  that  whenever  XcJlj  with  P^X,p(x)^  <  i 
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F(X)  is  defined  and  p(f(X),  Fp(x))  <  £  . 
o(.<"0  <  5  holds  for  all  large  n  .  Let 
containing  x  .  Then  p(x))  <  6 

Hence  for  all  large  n 


Since 

,(n) 


lim 


n-^a> 

(n^  . 

^i 

so  that  P 


=  0 , 


(n) 


he  the  element  of  tj 

<  6 


y€Fp(x)C  ®  [-e,e]c  f(i,ti^''^)  e  [-£,&] 

This  proves  that  X  ^  f(i,t,^''^)  ©[-£,£]  .  Since  y  is  an  arbitrary 

point  of  U  ,  it  follows  that 

''n.i  ^(l.  «'“’)<=  l".i  ''(j’l'”*)  . 

Similarly, 

Cl  '■(j'l'”’) • 


Applying  (1-6),  we  have 

C=i  '  C.1  • 

In  order  to  prove  that  D*  Li  z(f,I,ti("))  ,  we  may  assume 

a(l)  >  0  .  It  is  sufficient  to  prove  that  for  every  integer  m  >  1  and 
every  £  >  0  , 

z;(f,i,?^“^^  ©[-£,e]  z)z(f,i,ti^''^) 


holds  for  large  n  .  In  fact,  if  this  is  proved,  then 

z(f,I,|^“^)  ©[-£,6]  z(p,I,Ti^''^)  . 

Since  t  is  arbitrary,  it  follows  that 
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Since  m  is  arbitrary,  it  follows  that 


Similarly, 


Cl  =>Ci  4-^’''^"') 

Cl  Cl 


Hence  our  assertion  follows. 


Now  we  let  m  be  an  arbitrary  integer  >  1  and  let  •••>  • 

Let  £,  be  an  arbitrary  positive  niomber  <  1  .  By  (6-3 )>  there  is,  for 
every  x€l  ,  a  positive  number  r^^  such  that  whenever  XeJ?  with  ^ 

p(x,p(xj)  <  r^  .  F(X)  is  defined  and  P^(X),  Fp(x)^<e/u^(p(l,|^“^^+^ 
Since  I  is  compact,  there  is  a  6  >  0  such  that  whenever  X€«iP^  with 
a(X)  <  5  ,  P^,p(x)^  <  rjj  for  some  x£l  . 

Since  n^^oB  o  «<■')  =  0  ,  there  is  an  integer  n^  such  that 


+  1 


for  all  integers  >  n-  .  Let  n  >  n  and  let 


T]^  ^  -  "I®!;  •••> 


For  every  i  =  1,  ...,  u  ,  there  is  a  largest  integer  j(i)  with 
P(A^)€Bj^^j  .  Clearly 

1  ^  J(l)  ^  J(2)  ^  ...  <  j(u)  =  V  . 
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Therefore 


e[-l,  l]CF(A^)  ©[-1,1] 


and  then 


and  consequently 


Since,  for  every  k  =  j(i-l)  +1,  •••,  j(i)  “  1^  A.  ,  it  follows 

K  X 

that 


C  F(A^)  0  per  (Aj^)  ,  i  =  1,  ...,  u  . 


Hence,  hy  adding  these  equations,  we  have 

E(F,I,q^''^CZ  e(p,I,|^“^)  ©[-£,£]  . 


This  completes  our  proof. 


q.e 
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Let  F  be  an  arithmetic  function  with  domain  J&(F)  .  Let  be  the 

subset  of  consisting  of  all  the  elements  X  of  J?  with  p(X)c:J&(F)  . 
Then,  by  (lO-U),  we  may  define  a  function 


F  :  - ►Je 


by 


F(x)  =  , 

^ere  XeJI^  eind  -<  ...  is  a  sectuence  of  subdivisions  of  X 

.  0  . 


vith  O 


(10-5)  bet  X  and  Y  be  elements  of  with  p(X)  =  Ci{Y)  .  Then 
XuYcJ?  and 

F(XoY)  =F(X)«F(Y)  . 

Proof.  Since  p(X)  *  a(Y),  X«Y€«I?  .  Since  XoY€«fF  ,  p(X)Clfi(F)  and 
p(Y)CJ&(F)  .  Hence  p(XwY)cJB(F)  and  consequently  XwYeJI^. 

If  y€p(R)  ,  then  F(Y)  =  F(Y)CZF(X)  so  that  our  assertion  is  obvious. 
If  Y  4  p(R)  f  then  we  may  have  a  sequence  of  subdivisions 


I  -  ^  V(n); 

of  XwY  such  that  ...,  n^^oo  =  0  and  for  every 

Integer  n  there  is  an  Integer  s(n)  with  •ftSi)  =  p(X)  .  Therefore 

(n)  _  r  (n)  (n) 

n  -  ,  •••» 

is  a'  sequence  of  subdivisions  of  X  such  that  .<  ...  and 


Urn 


=  0  ;  and  •••>  a  sequence  of 
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Bubdlvlsions  of  Y  such  that 


. . . 


and 


Since 

f(x  wy ,  =  f(x  ,  K  F^ , 


it  follows  that 


F(X  wy)  =  FCx)-  F(Y)  . 


(10-6)  F  :  — >.4  is  continuous. 

Proof.  Let  YeJl^  and  let  £  >  0  .  Since  F(a(Y)) 
a  6  >  0  such  that  whenever  Z  ei  with  a(z)  <  6  ,  1 

P(Z)  is  defined  and  p(f(Z),  Fpa(y)^<  t/k  .  Since 

P(Z)  3F(Z)  r>Fpa(Y)=  FpaCy)  , 

we  have 

p(?(z),  Ppa(Y))  <  eA  . 

Similarly  there  is  a  6'  >  0  such  that  whenever  Z'€< 
F(z')  is  defined  and 

p^(z'),  Fpp(Y))  <  eA  . 

For  every  XeJfp  with  P(X,  Y)  <  min  (6,6')  we  have 
p(z,pa(Y))<6  ,  p(z',pp(y)) 

and  one  of  the  following  holds: 


(1) 

a(x)  =  a(z). 

p(z)  =  a(Y), 

0 

II 

(2) 

a(x)  =  a(z). 

p(x)  =  a(z'). 

p(z)  =  a(y). 

(5) 

a(z)  =  a(Y), 

p(z)  =  a(x), 

p(Y)  =  a(z'). 

(M 

a(z)  =  a(Y), 

p(z)  =  a(x). 

p(x)  =  a(z'), 

-»*0- 


»(:<">)  .  0 . 


is  defined,  there  is 

(z,  pa(y))  <  &  , 


with  p(z',  pp(Z^<  6' 

Z,  Z'^J  such  that 
6' 

p(Z')  =  p(X)  ; 

p(z')  =  p(Y)  ; 

P(x)  =  p(z’)  ; 
p(z’)  =  p(Y)  . 
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In  case  (l)  we  have,  by  (10-5), 


F(X)  =  F(Z) w  F(Y) w  F(Z' ) 


Then  F(x)Z)F(Y)  .  Since 


F(Z)C  Fpa(Y)  @  [-£/2,  £/2]  CIF(Y)  @  [-£/2,  e/2]  , 


emd  similarly. 


F(Z'  )C  F(Y)  ©  [-e/2  ,  £/2]  , 


it  follows  that 


F(X)CI  F(Y)  ©[-£,£]  . 


Hence  p(f(x),  F(y))<  e  . 


In  case  (2),  we  have,  by  (10-5), 


F(X)  «  F(Z‘  )  =  F(Z)  wF(Y) 


Since  P{Z)C  F(Y)  @[-e/2,  V2]  and  F(Z’ )  C  F(Y)  ©  [- e/2  ,  e/2]  ,  it 
follows  that  F(X)  C  F(Y)  ©[-£,£].  On  the  other  hand, 

p(?(Z’)  ,  FpP(X))  <  p(V(Z'),  Fpp(Y))  @  P^pp(X),  Fpp(Y)) 


<  e/u  +  e/4  =  e/2 


so  that 


F(z' )  C  P(X)  @  [- £/2  ,  e/2]  . 

Hence  P(Y)CF(X)  +  [-  £,  &]  .  This  again  proves  that  P^(X),  F(Y)^<  C 
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Similar  argument  shows  that  our  assertion  also  holds  for  the  other  two 
cases.  q^.e.d. 

(10-7)  Let  I  be  an  element  of  Ji  and  let  f  :  I  — ►  R  be  a  con- 

be  a  sequence  of  arithmetic  functions  with 


tinuous  function.  Let 
f  •  Then 


F^(I)  ^PgCl)  Z). 


and  Fjl)  =  f(l)  . 


Proof.  Let  n  be  an  integer  1  .  Let  £  >  0  .  For  every  x  eA 

there  is  an  >  0  such  that  whenever  X  g*?  with  p(x))  <  r^  , 

F^(X)  and  are  defined  and  p(f^^^(X)  ,  <  £.  .  Let 

6  be  a  positive  number  such  that  whenever  XeJj  with  a(x)  <  6  ,  we 


have  ,  p(x) 

Let  1  =r  <,'1 

1,  ^ 

1; 

every  i  -  1 ,  . 

Then 

■(ii.pCxj)) 


<  r. 


Xi 


so  that 


Hence 


fnd.f)  @[-6,e] 

r„+i(i,S)cF^(i,{)@[-e,c]  . 

-42- 
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Since  G  is  arbitrary,  it  follows  that 

This  proves  that 

?^(I)  DF^d)  =>...  . 

Clearly  F^(l)  I5f(l)  for  all  n  .  Let  G  >  0  . 


For  every  n  =  1 , 2 ,  . . .  we  let 

I  I  Pf  (x))  >  ej  . 

It  follows  from  the  continuity  of  F  that  I  is  closed  in  I  .  Since 

n  n 

Ij,  3  Ig  ID...  and  =  ^  ,  we  infer  that  there  is  an  integer  n^ 

such  that  vdienever  n  >  *  P^*  (x)}  <  6  holds  for  all  xel  . 

Let  n  >  n^  .  Then  for  any  subdivision  |  =  ...,  of  I  we 

have 

Fn(li)c:  fdf)  @[-G,e]  ,  i  =  l,...,  r 

Hence 

?^(I)C  F^(l,|)Cf(l)  ©[-G,&]  . 

Since  G  is  arbitrary,  we  infer  that 

F^d)  =  f(l)  q.e.d. 


Theorem  5* 


Let  ICiH  and  let  f  ;  I  — ►  R  be  a  continuous  function. 

T  J  _ 

be  a  sequence  of  arithmetic  functions  with  n->>oo  F^  =  f  • 
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Then  there  Is  a  sequence  ■<  ^ 


..  .  lim 

xn&v  00  ^ 


and 


n“  F  = 

n=i  nV’'*  y 


of  subdivisions  of  I  sdch 


f(l)  . 


If,  moreover,  for  every  xc  I  and  every  integer  n  >  1  ,  p(x)  is 


n+i 


contained  in  the  Interior  of  F  p(x)  ,  then  there  is  a  seq^uence 


n 

of  subdivisions  of  I  such  that 


lira 

n->-» 


aai  p^(i,6*''*)  -  f(i)  ■ 


(!<■'>)  -  0  , 


Proof.  By  ( 10-7 ) ,  we  have  F^ ( I )  3  Fg( I ) 

(i) 


and  F^(I)  =  f(l)  . 


Let  5 
P 


<  1  and 


be  a  subdivision  of  I  such  that  a  I 

<  1  .  Suppose  that  we  have  subdivisions 

and 


We  let  be  a  refine- 


F(I)^ 

^  . . .  ■<  of  I  such  that  l/n 

p(f(i,5^''^)  ,  F(I)^<  l/n  ,  n  =  1, 
ment  of  with  <  l/(k+l)  .  By  induction, 

we  have  a  sequence  ...  of  subdivisions  of  I  with 

,  ?„(!))  <  l/n  ■  Hence 

Cl  ''nG.?'”’)  =  '(I)  ■ 


If  for  every  xe  I  and  every  integer  n  1  ,  p(x)  is  contained  in 

the  interior  of  F^p(x)  ,  as  in  the  proof  of  (IO-7),  there  is  a  >  0 
such  that  whenever  |  is  a  subdivision  of  I  with  o(|)  <  5^  , 

F^  (  1,1)  3  F^_^^  (1,1),  n  =  l,2,...  .  Now  we  construct  a  sequence 
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|(i)  ^  of  subdivisions  of  I  ,  just  as  above,  satisfying  the 

addltioneO.  condition  that 


a(6^  <  6^  ,  n  -  1,  2,  ... 

Then  our  conclusion  follows.  q.e.d. 

Let  F  be  an  arithmetic  function  with  domain  J&(F)  and  be  the  sub¬ 

set  of  Jl  consisting  of  all  the  elements  1  of  Jt  with  p(l)C  fi(F)  . 
Then,  by  (10-4),  we  may  define  a  function 

by 

E,(i)  =  n*  ^  ' 

where  and  ...  is  a  sequence  of  subdivisions  of  I 

with  =  0  . 

As  (10-5),  (10-6)  and  (10-7),  we  have 

(10-8)  Let  I  and  J  be  elements  of  Jp  with  p(l)  =  Cit(j)  .  Then 
I'*  J  ejiF  and 

Zp(l  wj)  =  Lp(l)  ©Zj.(J)  . 

F 

(10-9)  2^  ;  J?  ill  is  continuous. 

(10-10)  Let  I  =  ^a,b2ej  and  let  f  :  I  — ►  R  be  a  continuous  function. 
Let  ■fFjjT  1*®  ®  sequence  of  arithmetic  functions  with  n^.^oo  ~  ^  ' 


-45- 
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Then 


and  Zj,  (.1)  -  1  f  (x)dx 


n 


Making  use  of  (lO-lO)  and  the  definition  of  Zp  we  can  prove 

Theorem  I;.  Let  I  =  [a,h]€  J?  and  let  f  :  I  — ►  R  he  a  continuous 
function.  Let  he  a  sequence  of  arithmetic  functions  vith 

n'^^oo  ~  ^  ‘  Then  there  is  a  sequence  •••  of  suh- 


divisions  of  I  such  that  n^i^oo  ~  ^ 

Cl 


dx  . 


If,  moreover,  for  every  xel  and  every  Integer  n>l  is 

contained  in  the  interior  of  F^pj(x)  ,  then  there  is  a  sequence 


5(1)^  |(=)k 


of  subdivisions  of  I  such  that 


lim 

n->-» 


=  0  , 
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ABSTRACT 


The  paper  constitutes  a  study  of  the  convergence  of  infinite  products 
of  unitary  matrices  connected  with  Jacobi  methods  for  computing  eigen¬ 
values  of  Heraiitian  matrices.  Bounds  are  obtained  for  the  error  in 
approximate  eigenvectors  resulting  from  a  finite  number  of  steps  of 
three  Jacobi  processes,  namely,  the  classical  Jacobi  method,  quasi- 
cyclic  restricted  Jacobi  methods  and  threshold  cyclic  Jacobi  methods. 
The  results  apply  only  to  Hermitian  matrices  which  do  not  have  re¬ 
peated  eigenvalues.  Also,  certain  questions  are  answered  concerning 
the  representation  of  an  arbitrary  unitary  matrix  as  an  infinite 
product. 
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CONVERGENCE  OF  APPROXIMATE  EIGENVECTORS 
IN  JACOBI  METHODS 


1.  INTRODUCTION.  In  this  paper  we  are  Interested  in  studying  the  con¬ 
vergence  of  infinite  products  of  unitary  matrices  connected  with  Jacobi 
methods  [4]  for  computing  eigenvalues  of  Hermitian  matrices.  We  obtain 
bounds  for  the  error  in  approximate  eigenvectors  resulting  from  a  finite 
number  of  steps  of  a  Jacobi  process.  Also,  we  are  able  to  answer  cer¬ 
tain  questions  concerning  the  representation  of  an  arbitrary  unitary 
matrix  as  an  infinite  product.  Use  is  made  of  the  classical  results  of 
Jacobi  [5]  and  of  the  recent  results  of  Henrici  [4]  and  Pope  and  Tompkins 
[7].  Basically,  we  adhere  to  the  notation  and  terminology  used  in  [4]. 

Consider  the  following  computational  algorithm.  Let  A  Aq  r-  (a^^) 

be  a  Hermitian  matrix  of  order  n  with  eigenvalues  A.^.(r  r;  1^2,  ...,  n). 

(k ) 

One  calculates  a  sequence  of  matrices  Aj,  Ag,  ...,  ),  ... 

which  are  unitarily  similar  to  A  by  the  recurrence  relation 


Ak+1  ==  (1.1) 

(4) 

The  Ujj  =  (uj.j,  )  are  special  unitary  matrices  of  order  n  .  For  every 


value  of  k  there  is  specified  a  pair  >  "^k^  (i  j)  of" 

indices  (we  omit  the  subscript  k  in  the  sequel  for  notational  sim¬ 


plicity)  satisfying  lsi<jsri,  such  that  the  2x2  matrix 
*  Bracketed  numbers  indicate  References,  page  21. 
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(1.2) 


which  is  a  principal  submatrix  of  U  ,  is  unitary.  All  other  elements 

Iv 

of  U]j  satisfy 


The  matrices  are  completely  determined  by  the  pairs  and  the 

2x2  unitary  matrices  . 


(1.3) 


Any  set  of  rules  for  choosing  the  is  called  a  Jacobi  method. 

A  Jacobi  method  is  said  to  be  convergent  if 

Ar  >  A  (k  >  »)  ,  (1.4) 


for  all  A  under  the  adopted  set  of  rules,  where  A  is  a  diagonal  me-  -'x 
whose  diagonal  elements  are  the  in  some  order.  Either  of  the 

quantities 

=  Z  (1.5) 

or 

Mk  =  Max  (1.6) 

T^C 

may  be  used  as  a  measure  of  the  closeness  of  Aj^  to  A  .  For  the 
Jacobi  methods  to  be  considered  in  this  paper,  a  necessary  and  suffi¬ 
cient  condition  for  convergence  is  that  Sj^  >  0  (k  >  »)  or. 
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equivalently,  >  0  (k  >  ®)  .  See  either  [2]  or  [k]  for  the  proof.  If 
the  infinite  product  of  matrices 

u  =  JE)  Uk  =  U0U1U2  ...  (1.7) 

converges,  then 

U*AU  =  A  ,  (1.8) 

and  the  columns  of  U  are  a  complete  set  of  normalized  eigenvectors  of 
A  .  We  shall  Investigate  the  convergence  of  (I.7)  for  three  convergent 
Jacohi  methods:  (a)  the  classical  Jacobi  method,  (b)  the  quasicyclic 
restricted  Jacobi  methods  [4]  and  (c)  the  threshold  cyclic  Jacobi  method 

[7].  Throughout  the  paper  we  shall  restrict  Vj^  to  be  of  the  form 

iOk 


Vk  = 


-i<tii, 


cos  Sjj  -e 


sin  0, 


sin  Gv 


cos  Gr 


(1.9) 


By  the  classical  Jacobi  method  we  shall  mean  a  Jacobi  method  with  the 
following  set  of  rules  for  determining  the  Ujj  of  (l.l).  Choose 
such  that 

^  ^  (1-10) 

and  choose  the  Vj^  of  (I.9)  such  that 

=  0  .  (l.U) 

ij 

The  equation  (l.ll)  will  always  be  satisfied  if  the  relations 


<’k  =  arg  a^y  , 


tan  20'!^  - 


2  lajj)! 

-  at.V 

li  OJ 


(0k  =  0^^  ) 


(1.12) 


(1.13) 
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hold  simultaneously.  Since  Jacobi  dealt  only  with  real  symmetric  matrices 
A  and  orthogonal  matrices  ,  the  set  of  rules  (l.io)-  (I.13)  consti¬ 
tute  a  generalization  of  his  original  method. 

A  Jacobi  method  is  called  cyclic  if  in  every  segment  of 
N  =  n(n  -  l)/2  consecutive  elements  of  the  sequence  {nj^)  every  pair 
(p^  q.)  (1  ^  P  q.  S  n)  occurs  exactly  once.  This  Is  a  special  case  of 
a  quasicyclic  Jacobi  method  with  period  K  where  the  following  condition 
is  imposed.  In  every  segment  of  K  5  n  consecutive  elements  of  the 
Sequence  every  pair  (p^  q)  (l  ^  P  <  0.  S  n)  occurs  at  least  once. 

We  next  define  a  restricted  Jacobi  method.  Let  0*  be  an  angle  satis¬ 
fying  (1.15)  and  belonging  to  the  closed  interval  njk\  ,  and  let 

ilr  >  0  be  a  constant  angle  not  depending  on  k  .  Then  any  method  in  which 
♦jj  is  chosen  to  satisfy  (1.12)  and  0j^  is  chosen  such  that 

sign  6^  rr  sign  0^  ,  |  -  Min  ( ( ej  |  ,  )  (l.l4) 

is  called  a  restricted  Jacobi  method  with  bound  .  In  Ck]  it  was 
proved  that  any  quasicyclic,  restricted  Jacobi  method  with  suitable 
bound  converges.  Furthermore  it  is  shown  in  [4]  that,  if  A  has  n 
distinct  eigenvalues,  any  quasicyclic  Jacobi  method  in  which  ,  <t>j^ 

satisfy  (1.12),  (1.15)  and  6*  e[  -  ji/4,  njk]  converges  quadratically 
provided  the  off diagonal  elements  are  already  sufficiently  small.  More 
precisely,  if  4njij5^  ^  ^  then* 

*  Mr.  E.  R.  Hansen  has  shown  (oral  communication)  that  the  exponent  in 
the  exponential  function  may  be  replaced  by  /§nKd“^p 

k 
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(1.15) 


1.33^nK® 


S  2®  n(K  -  l)e 


(i“iii2 


^^k  » 


where 


d  =  Min  |X  -  X.|  . 

r  ^  c  ^ 

The  threshold  cyclic  Jacobi  method  Is  a  modification  of  the  cyclic 
Jacobi  method  due  to  Pope  and  Tompkins  [7]-  It  depends  on  the  choice  of 
a  sequence  of  positive  threshold  values  t^^(v  =  0,  1,  2.,  ...)  such  that 
"^v+i  ^•ll  ^  fixed  cyclic  ordering  Is  adopted  for 

the  Jtjj  j  always  satisfies  (1.12),  and  for  each  v  ,  one  chooses 


ev  = 


0  ,  for  |a 


<  t 

IJ  I  ^ 


(1.16) 


When  all  <  t^  (r  ^  c)  ,  t^  Is  replaced  by  t^^j^  .  Pope  and 

Tompkins  prove  that  the  threshold  method  converges  If  ^l^m  t^  =  0  .  In 
the  present  paper  we  shall  take  the  sequence  (t^)  to  be  a  geometric 


progresslonj  l.e.,  we  shall  assume 

t,  =  oq^'  (0  <  q  <  1) 

V’ 


(1.17) 


2.  NORMS  OF  MATRICES.  In  this  section  we  summarize  for  convenience  of 
reference  certain  known  results  concerning  norms  of  matrices.  Let 
H  =  (hyg)  eind  G  denote  square  matrices  of  order  n  .  The  norm  (cf. 

[1])  of  a  square  matrix  H  Is  defined  to  be  a  non-negative  number 
N(H)  satisfying  the  conditions 

N(H)  >0  for  H  ^  0  ,  (2.1) 

N(cH)  =  lc|  .  N(H)  for  all  scalars  c  ,  (2.2) 
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N(H  +  G)  S  N(H)  +  N(G)  ,  (2.5) 

N(HG)  S  N(H)  .  N(G)  .  (2.4) 


This  definition  is  equivalent  to  that  given  by  Ostrovski  [6]  for  his 
’’inultiplicative  norm."  By  (2.4) 

N(h)  =  N(HI)  ^  N(H)  .  N(I)  , 


hence 


N(I)  2  1  (2.5) 

for  any  norm.  There  exist  norms  for  vhich  N(l)  w  1  ,  an  example  being 
the  familiar  spectral  norm 


»i(E)  “  ite*  (— srj 

x  ^  O  '  XX/ 


2 


(2.6) 


Note  that  for  any  unitary  matrix  U  ,  N^(U)  =  1  .  The  Euclidean  norm  is 
defined  by  the  relation 

n 


NgCH)  -  (  H  Kch 

r,c-l 


1 

"S’ 


(2.T) 


It  is  veil  knovn  that,  for  all  H  , 


-4 

n  ^  g  — —  g  1 

N2(H) 


(2.8) 


Inequalities  such  as  (2.8)  hold  quite  generally.  In  fact,  it  vas  shovn 
in  [6]  that  if  N(H)  and  N’(H)  are  any  tvo  matrix  norms,  then 

a(N,N’)<  <b(N,N')  (2.9) 

"  '  N'(H)  '  '  ' 

vhere  a(N,N' )  and  b(N,N' )  are  positive  constants  vhich  do  not  depend 
on  H  .  It  is  easy  to  shov  that 
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Max  Ih^cl  ^  Ni(H)  ,  (2.10) 

euid  by  (2.9)  there  exists  a  positive  consteuit  b(Ni,N)  ,  independent  of 
H  ,  so  that 

|h^J  ^  b(Ni,N)  .  N(H)  (2.11) 

holds  for  all  r  and  c  and  for  any  norm. 

5.  PRELIMINARY  THEOREM  ON  INFINITE  PRODUCTS.  A  sequence  of  n  x  n 

/  (k), 

matrices  ...,  Bjjj,  ...  with  Bj^  (b^.^  )  is  said  to  be 

convergent  if  a  limit  exists  for  each  element  of  B^j^  .  That  is,  if 
lim  bi™^  =  b^^  ,  then  the  matrix  B  =  (b„„)  is  said  to  be  the  limit  of 

in><» 

the  sequence  (B^)  .  If  the  sequence  is  not  convergent,  it  is  said 

to  be  divergent .  An  infinite  product 

(I  +  EJ(I  +  E^(I  +  E3)  ...  (5.1) 

* 

of  n  X  n  matrices  is  here  called  convergent  if  the  partial  product 

m  , 

Pm  “  2^  ®k)  (5.2) 

converges  to  a  limit.  We  now  prove  the  following 

THEOREM  1,  Let  N(h)  denote  some  norm  of  the  matrix  H  .  Let  the 
factors  of  the  infinite  product  (5*1)  ^  n  x  n  unitary  matrices,  and 
let  the  series  LN(Ejj)  converge .  Then  (5*1)  converges  to  a  unitary 
matrix  U  .  Furthermore,  for  all  m 

CO 

N(U  -  Pj  g  c(N)  21  N(Ek)  ,  (3.3) 

k=m+l 

where  c(N)  >  0  depends  only  on  N  aind  n  . 

*  In  defining  convergence  of  infinite  products  of  matrices  in  general,  it 
is  necessary  to  exclude  nonsingular  factors  in  the  product  and  also  to 
distinguish  between  sing\ilar  and  nonsingular  limits.  However,  since  we 
are  dealing  only  with  unitary  matrices  in  the  present  paper,  it  is  un¬ 
necessary  to  complicate  the  discussion  with  such  distinctions. 
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Proof:  We  first  establish  that  if  the  sequence  (Pm)  has  a  limit  U  , 
then  U  must  be  a  imitary  matrix.  Let  Pj^  =  U  +  Qjjj  .  We  have  that 
Qjjj  >  0  (m  >  »)  .  Since  each  factor  I  +  Ej^  is  unitary,  P^j^  is  unitary 
for  all  m  .  Thus 

“  I  “  *  %.)  - 

or 

I  -  U*U  «  U*Q  +  ftiU  +  0*0  . 

The  right  hand  side  of  the  last  expression  tends  to  the  null  matrix  as 
m  >  “  ,  and  this  shows  that  U  is  unitary.  Let  r  >  m  and  note  that 

‘’r  -  ‘’m  -  +  Vl)  •  (J-**) 

We  shall  prove  that  P^^  has  a  limit  with  the  help  of  the  following 

LEMMA.  If  Hg,  ...,  Hp  and  Gg,  ...,  Gp  are  any  two  sets  of 

n  x  n  matrices,  then  the  following  identity  holds 

Hj^Hg  ...  Hp  -  G^Gg  ...  Gp  «  -  Gi)Gg  . . .  Gp  (3.5) 

+  Hj^(Hg  —  Gg)Gg  ...  Gp  +  ...  +  ...  Hp_j^(Hp  —  Gp)  . 

The  truth  of  (5.5)  Is  easily  established  by  induction  on  p  .  The 
details  are  left  to  the  reader.  Letting  «  I  +  ,  Gj^  -  I  and 

combining  (3-*t-)  and  (3 '5)  have 

~  ^m  “  ^m  ^^+1  +  ^4l^®m+2  (3«6) 

+  (I  +  E  J(I  +  E  )E  +  ...  +(I  +  E  ,,)  ...  (I+E  JE  ]  . 
'  m+l^'  m+s'  m4o  '  m4l''  '  r-1''  r 

Products  of  the  matrices  (l  4  Ej^)  are  unitary,  hence  by  (2.^+),  (3*6) 

Ni(Pr  -  Pm)  ^  Ni(Em4j  4  ...  4  Ni(Er)  ,  (3.7) 
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since  the  spectral  nonn  of  a  unitary  matrix  is  one.  Now  for  an  arbitrary 
norm  N  ,  there  exist  a(N)  >  0  and  b(N)  >  0  such  that 
a(N)  .  N(H)  g  Nj^(h)  s  b(N)  .  N(H) 


holds  for  all  H  .  We  have  therefore  that 


a(N)  ,  N(P  -  P^)  §  Ni(P^  -  P^) 
r  m'  r  m' 


(5.8) 


21  Ni(Ei^)  S  b(N)  21  N(Ek)  . 

k»fflH-l  ksfflH-1 

Letting  c(N)  ®  b(N)/4(N)  and  combining  (3.7)  “  (3.9),  we  get 

r 

N(Pj.  -  Pj  g  c(N)  21  »(Ek)  • 

kaaiH-l 


(3.9) 


(5.10) 


By  hypothesis  the  series  ZN(E^)  is  convergent.  Therefore,  given  any 


e  >  0  ,  there  exists  an  integer  ^  >  0  such  that 


^  N(E,  )  <  - - -  . 

[i+l  ^  b(N)  .  c(N) 


(3.11) 


We  see  from  (3.10)  and  (3.11)  that 


^(PnH-p  ~Pm)^  lj(U)  (5.12) 

holds  for  all  map  and  for  P  «  1,  2,  ...  ;  but  (3.12)  is  precisely 


the  condition  that 


l^^m+p  ~^m^rc  ^  ^ 


(5.13) 


holds  for  all  r  and  c  by  (2.11).  Hence  Pj^  tends  to  a  limit  U  as 
m  00  .  Finally  letting  r  >  oo  in  (3. 10)  we  obtain  (5.5)  this 
proves  Theorem  1. 


'The  spectral  norm  is  fixed;  hence,  the  a(Nj^,  N)  and  b(N^,  N) 

of  (2.9)  do  not  depend  on  it.  Therefore,  for  simplicity,  we  have  here 
written  a(N)  for  a(Np  N)  and  b(N)  for  b(N^,  N)  . 
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U.  CONVERGENCE  OF  EIGENVECTORS  AND  ERROR  ESTIMATES.  We  first  give  a 
rather  general  result  about  convergence  emd  estimation  of  the  error  in 
all  the  eigenvectors.  Following  this  we  give  estimates  for  the  three 
Jacobi  methods  considered  in  this  paper. 


THEOREM  2.  Let  the  n  x  n  Hermltian  matrix  A  have  n  distinct 
eigenvalues,  and  let  the  sequence  of  matrices  (l.l)  be  generated  by  a 
Jacobi  method  such  that  the  angles  9j^  satisfy  (I.13),  every 
®k  n/k],  and  such  that  the  series 


00 


koO 


converges.  Then  the  infinite  product  (I.7)  converges .  Furthennore , 
for  all  m  >  iBq  we  have 

where 

d  I  Xj.  -  \1  ,  (4.2) 

and  where  m^  is  the  smallest  integer  such  that  for  all  m  >  mQ 


m 


(^.3) 


Proof:  We  shall  make  use  of  the  fact  that,  for  some  ordering  of  tne 
eigenvalues  X^  of  A  , 

\\  -  ^  (r  -  1,  2,  n)  (4.4) 

hold  for  all  k  .  The  inequalities  (4.4)  follow  from  Theorem  1  of  [4]. 

Note  that,  while  (4.4)  is  valid  for  any  value  of  k  ,  the  ordering  of  the 
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eigenvalues  Aj.  Is  generally  not  the  same  for  all  k  .  However,  our  use 
of  (k.k)  does  not  depend  on  the  ordering  of  the  . 


Let  the  Uj^  of  ( 1.1)  he  expressed  In  the  form 

Ujj  =  I  +  Ej^  ,  (1+.5) 

and  let  Ej^  **  (1*3)  and  (1.9)  we  see  that  the  2x2 

principal  submatrlx  of  Ejj  corresponding  to  Vj^  is  given  by 


.(k)  Jk)' 

-il  ®ij 

J1  JJ 


COS  6j^  -  1 
e  sin  Qr 


-e  sin  01 


cos  0j^  -  1 


(^.6) 


and  eill  other  elements  of  Ej^  are  zero.  From  the  definition  (2.7)  we 
get 

N2(Ek)  *>  [2(co8  ejj  -  1)®  +  2  sin^  , 

and  with  the  help  of  some  standard  trigonometric  identities,  we  obtain 

N2(Ejj)  =  S'^^lsin  — 

By  the  definitions  (I.5)  suid  (I.6) 


hk  s  7f  Sk 


Hence,  by  (4.3)  we  have 


^  1  d  ^  d  /  ^  V 

^m^VfTT^ir  (n»>m^). 

Using  (1.6),  (1.15)  and  (4.2) — (4.4)  it  follows  that 

o  I 

'  (k)7<— “T 


(4.7) 

(4.8) 

(4.9) 

(4.10) 


a  '  -  a 
ii  JJ 
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holds  for  k  >  nio  .  Hence,  by  (4.9),  |tan  20^1  <1  and 
\2  <  n/k  (k  >  iAq)  ,  since  0^  €[™jrA,  jt/4]  .  For  these  small  angles 

it  is  easily  verified  that  kjtan  (0^*)j  S  j  tan  20  | .  Hence  for  k  >  , 

we  have  from  (4.7)^  (4.8)  and  (4.10) 


(4.11) 


ly» 

By  hypothesis  the  series  converges.  Therefore  EM2(Ej^)  converges 

and  by  Theorem  1  the  infinite  product  (I.7)  converges.  Now  applying 
(5.5)  to  the  case  where  N  «  Ng  ,  we  see  from  (2.8)  and  (2.9)  that 


c(N2)  - 


b(Ni,  Ng) 


w  n 


1 

T 


(4.12) 


a(M^,  N^) 

Combining  (3.3)>  (4.11)  and  (4.12)  we  ob+ain  (4.1)  and  this  completes 
the  proof  of  Theorem  2  . 


As  before  we  let  N  denote  the  number  of  elements  on  one  side  of 

the  diagonal  of  an  n  x  n  matrix,  viz. 

N  —  nfn  -  1)  ,  (4.13) 

2 

and  for  convenience  we  define 

h  (I  -  N~’)^  .  (4.14) 

We  let  [x]  denote  the  Integral  par+  of  x  ,  i.e.,  the  largest  integer 
not  exceeding  x  .  Finally  we  let  a  [w/q^J  and  we  state  our  main 
result  as  follows. 


THEOREM  5.  Let  the  n  x  n  Hermitlan  matrix  A  have  n  distinct 
eigenvalues,  and  let  the  sequence  of  matrices  ( 1 . 1 )  be  generated  by 
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either  Method  (a):  the  classical  .tacobi  met, hod,  Method  (b):  a  quasicycllc 
restricted  Jacobi  method,  or  Method  (c):  a  threshold  cyclic  Jacobi  method 
In  ^Ich  the  thresholds  t^  satisfy  (1.17).  Then  the  Infinite  product 
of  matrices  (I.7)  converges.  Furthermore,  for  Method  (a) 

P 

d  ^  “o>  .  (4.15) 

for  Method  (b) 

3_ 

vhere  p  Is  given  by  (4.23)  below,  and  for  Method  (c) 

‘  ^  (="“0)'''  1^  <”  >  ”0)  '  ('*•17) 

Where  In  each  case  the  Integer  has  the  same  meaning  as  It  had  In 

Theorem  2. 

Proof:  It  is  easy  to  show  that,  for  any  step  in  a  Jacobi  method  in  which 
the  parameters  4^  satisfy  and  (I.I5),  the  following  eq,uatlon 

holds 

\+i  '  (4.18) 

In  the  classical  Jacobi  method  we  have 

l4 1^  s  Sk  (4.19) 

always  satisfied.  Hence 

Sk+i  ^  -  W"^)Sk  “  Sk  (k  =  0,  1,  2  ...)(4.20) 

holds  for  Method  (a).  It  follows  therefore  that 
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(4.21) 


Z  ^  =  sg  (1  4  h  +  h®  +  ,..)  =  S-g  A  , 

Vi.A  ^  ** 


hence  by  Theorem  2  the  product  (I.7)  converges.  By  (4.20) 

_  m-i 


k=m  ^  ° 

holds  for  all  m  .  For  m  >  we  may  combine  the  last  expression  with 
(4.1)  and  obtain  the  inequality  (4.15). 


Now  consider  Method  (b).  We  note  that  ^  0  as  k.  •►  «>  ,  since 
a  quasicyclic  restricted  Jacobi  method  is  convergent.  Since  A  has  n 
distinct  eigenvalues,  there  exists  a  smallest  integer  p^^  such  that 

Min  ( (e^l,  \|f)  |0^(  (k  6  p^)  , 

and  by  (l.l4)  the  restricted  method  behaves  like  an  ordinary  quasicyclic 

method  from  the  step  p^^  on.  There  also  exJ  ..-ts  a  smallest  integer 

P2  36  Pi  ^  "^  (^  S  Pg)  and  hence  (I.I5)  holds  for  k  S;  Pg  . 

Since  the  sequence  (£^}  is  monotone  decreasing  and  >•  0  ,  there  is  a 

smallest  Integer  Pg  such  that  (4.5)  Is  satisfied,  viz. 

d 

^  s  ^  (k  g  Pg)  , 

and  therefore  (4.11)  holds  for  k  g  Pg  .  Finally,  a  smallest  Integer 
P4  exists  such  that 

(kSP.),  (4.22) 

where  g  is  the  coefficient  of  in  (1*15 )•  Now  let 

p  :=  Max  (pg,  Pg,  P4)  .  (4.25) 

For  k  g  P  we  have  that  (I.I5)  and  (4.11)  hold,  hence 
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^  IbO  JuO  » 

8’^ 

*  7  UMo  +  +  ...) 


+  (Pp+1  +  8H=+3.  +  +  ...)  4  ... 

■*■  ®Pp+K-i  ■*■  ®**^P+K-1  ‘  *  ■  ^^  > 


dg  Vl-gu.  1-6 


'gU 


PH 


^Pp+K-i 

1-gp, 


P+K-i 


and  using  (4.22)  we  get 


4/2 


*'2(®k)  S~(Pp  +  Pp+i  +  •  •  •  +  PpfK-1^  • 


(4.24) 


Since  Sjj  is  monotone  decreasing, 

i  —  /-  JL. 

PfHv  ^  S  S’J  S  /Npp  S  yg  Pp  (v  «=  0,  1,  2,  .  . . ), 

we  combine  this  with  (4 . 24 )  to  obtain 


NaCEk)  ^  ^ 


kcp 


(4.25) 


Now  combining  (3.3)>  (4.12)  and  (4.25)  we  get  (4.l6),  and  since 
Ujjj  >  0  (m  >  eo)  this  proves  the  convergence  of  (I.7)  for  Method  (b). 


Turning  our  attention  to  Method  (c),  we  see  by  (I.I6)  that  Uj;  ^  I  , 

(k) 

Ejt  e  0  ■vdienever  1  ®  process  (l.l)  counts 

only  when  a  non-trivial  transformation  is  made,  l.e.,  whenever 
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(k) 

|a^j  I  £  ty  .  We  let  the  scalar  a  in  (I.I7)  "be  equal  to  Hq  so  that 

|aj.cl  S  Uo  =  40^1°  •=  ■to  (r  ^  c)  . 

There  may  be  no  non-trivial  transformations  made  at  any  given  threshold 
level,  but  if  there  are,  we  can  easily  bound  their  number.  Assume  that 
after  the  j>-th  transformation  Uy  <  .  Then 

Sr  <  2NtJ.i  ,  (^.26) 


and  by  (4.l8) 

\+i  ^  ^  (^.27) 

holds  for  each  transformation  made  at  the  v-th  threshold  level,  since 

Combining  (4.26)  and  (4.27)  we  see  that 

S  £  2Nt®  -  2pt? 
r+p  v-i  ^  V 

holds  after  p  transformations  at  the  v-th  level.  Letting  [x]  denote 
the  integral  part  of  x  ,  we  have  that  there  are  at  most 

a. 


steps  at  the  v-th  level,  for 


r+a,. 


<  2Nt2 


v-i 


<  2Nt2_^ 


<  2t^ 

BO  that  |ir+0  ^v  f  i*®*;  "the  next  threshold  level  has  been  reached. 
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From  (1.17) 


We  have  proved  that 


so  that,  using  (I.I7) 


^  =  a  (v  =  1,  2,  ...) 


S  2Nt2  (v  =  1,  2,  ...)  , 


va 

Now,  since  s[t/s]  g  t  for  all  positive  integers  s  and  t  ,  we  may 
write 

00  CQ 

^  s'k  ^  ^  s'^rki  ^ 

k=m  ^  “  k=m  (jjilJ 
00 

g  s^rki 

Let  i  he  one  of  the  integers  0,  1,  . . . ,  o-l  and  let 

k  =  of]  +  i  +  JO  . 

We  have 

[■ffl  '  [<5  -"[a]  •*■  a]  ^ 
and  since  i  <  a  ,  we  have 


V 

.0, 


J  + 


(i  =  0,  1,  •••>  cf—l)  • 


Thus  we  may  wi'ite  (It-. 29)  as  follows 

0—1  00 


Sk  S  2Z  S  / ,  .nn"|N  f 

krtn  i*0  JbO  ^LoJ' 

00 
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('+.30) 


and  using  (i4-.28)  we  have 


t  s?  *  o(2%j'‘  f 


k«<n 


J«0 


oq 


“]  1 


Finally,  combining  (^+.1)  with  (4.30)  we  get  immediately  the  expression 
(4.17).  Obviously, 

rml 

q^^-*  >0  (m  >  <»)  , 

and  thus  the  product  (l.?)  converges  also  for  Method  (c).  This 
completes  the  proof  of  Theorem  3. 


5.  REMARKS.  If  we  refer  to  \initary  matrices  of  the  general  type  used 
in  the  process  (l.l)  as  elementary  unitary  matrices,  then  it  is  a 
rather  easy  matter  to  show  that  an  arbitrary  unitary  matrix  can  be 
decomposed  into  a  product  of  a  finite  number  of  elementary  unitary 
matrices.  In  [3]  (footnote  (5),  page  8)  Givens  gave  a  proof  that  every 
orthogonal  matrix  with  posl+ ’  determinant  can  be  expressed  as  a 
product  of  a  finite  number  smentary  orthogonal  matrices.  We  shall 

not  give  the  details  here,  but  a  proof  of  the  corresponding  result  on 
the  decomposition  of  an  arbitrary  unitary  matrix  cein  be  constructed  by 
a  method  very  much  the  same  as  that  used  in  [3]. 

In  spite  of  the  result  Just  mentioned,  it  is  perhaps  not  without 
some  Interest  that  we  can  use  Theorem  3  to  prove  that  an  arbitrary 
unitary  matrix  can  be  expressed  as  an  infinite  product  of  elementary 
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vinitary  matrices.  The  decomposition  is,  of  course,  not  unique.  We 

« 

sketch  a  proof  as  follows. 

Suppose  W  is  any  n  x  n  unitary  matrix.  Let  D  he  a  fixed 
n  X  n  diagonal  matrix  with  n  distinct  and  real  diagonal  elements. 
Let  the  Hermltian  matrix 

A  *  WIW» 

he  diagonalized  hy  (say)  Method  (a)  of  Theorem  5,  so  that 

U*AU  =  D*  , 

\rt3iere  U  is  the  limit  of  the  Infinite  product  (l.?)*  The  matrix  D' 
has  the  same  diagonal  elements  as  B  hut  possibly  with  a  different 
arrangement.  By  a  finite  number  of  elementary  unitary  transformations 
with  Vjj  of  the  form 


we  can  transform  D'  Into  D  ,  so  that 

AV  D  , 

where  V  is  an  infinite  product  of  elementary  unitary  matrices.  From 
(5.1)  and  (5.2)  we  see  that 

VDV*  =  WDW»  . 

It  is  easy  to  show  from  (5*5)  that  W  can  he  expressed  as  VY  where 
Y  Is  a  diagonal  unitary  matrix,  l.e. 

lot— 

y^j,  =  e  (r  B  1,  2,  • . . ,  n)  . 

Since  any  diagonal  matrix  Y  satisfying  (5.4-)  can  he  decomposed  as  a 
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product  of  at  most  n-1  elementary  unitary  matrices  with  of  the 


form 


the  desired  result  is  established. 


I 
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ABSTRACT 


A  method  of  syntactic  analysis  of  declarative  English  sentences,  proposed 
hy  Bar-Hillel  and  Lamhek,  was  operationally  extended  in  such  a  manner  that 
the  product  of  the  analysis  is  a  machine  retrieval  language.  Syntactic 
analysis  thus  becomes  a  recursive  translation  from  the  natural  language  to 
the  proposed  machine  language.  This  method  is  called  "microsyntactic  analysis 
and  the  smallest  meaningful  linguistic  units,  or  morphemes,  of  the  retrieval 
language  are  termed  "microsyntactic  indices." 

A  three-value  storage  logic  was  devised  for  storage  and  retrieval  operations. 
Various  relations  between  syntactic  strings  and  algebraic  strings  were  de¬ 
fined.  A  characterization  of  semantic  correspondences  in  terms  of  algebraic 
relations  was  proposed  and  is  currently  ^lnder  study. 
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Section  1 


INTRODUCTION 

The  first  quarter  of  research  on  this  contract  resulted  in  a  proposal 
for  studying  word  correlation  in  a  sublanguage  of  English  and  presented 
a  method  by  which  correlations  could  be  computed  for  the  given  sublan¬ 
guage  The  method  of  syntactic  analysis  conceived  first  by 
Y.  Bar-Hillel  [5]  and  studied  by  J.  Lambek  [6]  was  arbitrarily  extended 
to  a  larger  part  of  the  language  than  intended  by  these  authors.  This 
extension,  termed  by  us  "macrosyntax”  because  it  provides  a  basis  for 
classification  of  lexical  units  into  function-sets,  may  not  be  justified 
on  logical  gro\inds;  it  was  taken  only  as  a  working  basis  for  further 
research. 

The  second  and  third  quarters  of  research  resulted  in  a  refinement  of 
syntactic  analysis  called  "microsyntax.”  The  chief  advance  consisted 
in  the  abstraction  of  syntactic  forms  from  semantic  and/or  functional 
irt erpretations  given  them  in  the  macrosyntax,  and  opening  the  way  to 
mathematical  characterizations  of  syntactic  forms  and  their  construc¬ 
tion.  Microsyntactic  indexing  was  also  introduced,  providing  a  means 
for  very  fine  semantic  destinctions  between  words  whenever  such  inter¬ 
pretations  are  desired  by  the  analyst.  Concur ren''  iy  with  these  studies, 
we  made  serious  efforts  to  find  number-theoretic  analogues  of  syn^ac-'ic 
structure  in  the  hope  that  syntactic  analysis  could  be  accomplished  cn 
standard  digital  computers.  In  this  we  were  only  parrtally  successful, 
and  these  efforts  had  been  abandoned  at  the  close  of  the  third  quar’er 

A  list  of  basic  English  words  was  abstracted  from  We-*  [7].  their 
syntac‘*’ic  forms  listed,  auid  frequencies  of  occurrence  per  mil  lion 
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coll^)uted  from  data  given  by  West  [7]-  This  list  was  conputer-programmed 
for  future  work;  the  program  is  described  in  the  appendix  of  this  report. 

The  fourth  quarter  of  research  has  resulted  in  the  conception  and  outline 
of  a  storage  algebra  of  microsyntactic  indices.  The  most  important  result 
of  ■‘his  work  is  the  characterization  of  semantic  correspondences  in  terms 
of  relations  defined  in  this  algebra.  We  feel  that  this  characterization 
gives  us  a  firm  basis  for  the  derivation  of  a  correlation  function. 
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Section  2 

A  GENERALIZED  SYNTAX  FOR  L 


In  [5]  we  presented  a  method  for  indexing  strings  of  words  which  are 
analyzable  in  terms  of  a  microsynteoc  of  English.  This  method  has  been 
generalized  and  is  presented  here  in  a  more  systematic  form.  In  the 
reference  cited  we  defined  heuristically  the  order  of  a  syntactic  form 
as  the  highest  number  of  strokes  separating  elements  of  the  form.  It 
will  be  observed  that  forms  of  order  greater  than  zero  are  constructed 
by  repeated  application  of  left  or  right  stroke  functions.  Thus,  we 
need  only  two  mappings  and  the  concept  of  order  of  syntactic  forms  to 
characterize  all  possible  constructions.  In  [2],  when  we  defined  the 
microsyntax  of  L,  we  defined  two  sets  of  elements,  A  =  (n^,  Sj^,  e^)  and 
B  =  (112,  Sg}  ,  and  constructed  all  elements  of  the  microsyntax  from 
these  sets  by  means  of  appropriate  mappings.  The  indie ial  feature  of 
this  method  was  extended  in  [3]^ and  the  method  of  microsyntactic  indexing 
was  introduced.  In  the  process  of  systematizing  the  method  of  syntactic 
analysis  originally  proposed  by  Bar-Hillel  in  [ij-],  we  have  abstracted  the 
constructive  properties  of  the  method  and  discarded  the  interpretations 
of  strings  of  these  elements  as  'noun  phrases,"  "verb  phrases,"  "sentences, 
etc.  These  Interpretations  may  serve  a  convenient  classification  purpose 
in  descriptive  semantics,  but  they  are  meaningless  syntactically.  In 
microsyntactic  indexing,  n  and  s  serve  an  indicial  function  only,  and 
may  be  replaced  by  any  convenient  symbol.  Let  Z  be  the  set  of  zero- 
order  indices.  Indices  of  higher  order  may,  in  general,  be  either 
entirely  associative  or  non-assoclatlve  between  certain  elements.  We 
denote  by  the  set  constructed  by  n  consecutive  applications  of  the 

left  stroke  function  on  elements  of  Z  ;  thus,  X  ,  is  the  set  of  elements 
of  the  form  a\b  ,  X  =  (a\b\c}  ,  etc.  Similarly,  we  denote  by  Ym  the 
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set  of  elements  formed  by  m  successive  applications  of  the  right  stroke 
function.  All  members  of  and  are  first-order  and  entirely 

associative.  Forms  containing  elements  which  are  non-associative  rela¬ 
tive  to  a  left  or  a  right  stroke  are  constructed  from  the  sets  3^  and 
Ym  .  These  forms  are  the  second-order  foms^  and  are  in  turn  used  in  the 
construction  of  third  and  higher  order  forms.  We  denote  the  many  possi¬ 
ble  sets  of  second-order  forms  by  where  X 

or  Y  Indicates  that  the  left  or  right  stroke,  respectively,  is  the  stroke 
function  relative  to  which  the  elements  are  non-associative;  and  the  ele¬ 
ments  of  the  superscript  sequence  indicate  the  number  of  right  or  left 
strokes,  respectively,  in  the  forms  following  the  forms  indicated  in  the 
paired  subscript  sequence.  For  example,  X^^q  {a\b'^c/d/e\\f} , 

Y^»^  ^  (£\b/  /c\d),  Yi>^  (a/b//c\d//e\f),  X^^°  {a/b\\c\d\\e) ,  etc. 

Most  of  the  possible  second-order  forms  do  not  occur  in  practice;  among 
these  are  the  forms  which  are  non-associative  relative  to  both  stroke 
functions.  Where  such  forms  should  be  necessary,  and  for  forms  of  higher 
order,  it  is  notationally  simpler  to  write  the  required  sets  in  terms  of 
the  left  and  right  strokes,  as  was  done  in  the  above  examples.  A  slightly 
liiore  cumbersome  notation  is  to  define  the  stroke  functions  as  functions  of 
ordered  pairs  of  sets  (as  we  have  iinpllcitly  done  above),  then  write  the 
actual  function  as  required.  For  example,  an  element  belonging  to  Y^''^ 
would  be  written  p[A(a,b),  A(c,d)],  wnere  A  and  p  are  the  left  and 
right  stroke  functions  respectively,  and  a,  b,  c,  d  are  all  elements 
of  Z  .  Some  convention  must  be  used  to  characterize  association  between 
elements;  capital  Greek  letters,  A  and  P  ,  may  be  used  for  this  puiqjose, 

I 

We  require  an  Identity  element  e  e  Z  ,  such  that 

A(e,a)  =T  p(a,e)  r.  a 

where  a  is  an  element  of  any  set.  Sequences  of  indices  are  said  io 
'•reduce'*  to  an  index  when  any  of  the  following  sequences  occur; 
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X(a,b)  X  (b,c)  r-.  X  (a,c) 

A(a,b)  A  (b^c)  -  A  (a,c) 
p(a,b)  p  (b,c)  p  (a,c) 

P(a,b)  P  (b,c)  =  P  (a,c) 

where  a,  b,  c  are  elements  of  any  set.  We  define  a  ’’syntactic  string" 
as  any  order  sequence  of  indices.  A  string  is  said  to  reduce  to  an  index 
•vrtien  sequential  pairs  of  indices  reduce  according  to  the  transitive  prop¬ 
erty  of  the  stroke  functions  given  above.  Conversely,  an  expansion  of 
any  index  is  any  string  obtained  by  the  converse  application  of  the  tran¬ 
sitive  property,  providing  the  Indices  used  in  the  e:qpansion  are  members 
of  sets  required  for  analysis  of  the  language.  In  particular,  if  q  is 
any  index,  we  denote  by  H  q  the  set  of  allowable  expansions  of  q  . 
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Section  3 

A  arORAGE-DETEMDENT  ALGEBRA  OP  INDICES 

The  central  idea  in  developing  a  storage-dependent  algebra  of  indices  is 
the  assignment  of  values  to  indices  and  algebraic  strings  of  indices 
according  to  the  results  of  a  storage-verification  procedure.  We  call 
the  values  thus  assigned  "s-values"  for  the  indices,  and  denote  them  by 
"O, ”  ’‘I,”  and  ”6."  It  is  assumed  that  the  storage  of  Indices  is  a 
continually  changing  storage,  but  that  storage-verification  occurs  for 
any  fixed  interval  while  the  storage  is  static.  The  s-values  0,  1  are 
interpreted  to  mean  negative  and  positive  results  of  the  verification 
procedures,  while  8  is  interpreted  as  the  (intentional  or  unintentional) 
non-application  of  the  verification-procedure. 

The  stipulation  of  s-values  for  algebraic  strings  of  Indices  is  to  some 
extent  arbitrary,  depending  upon  the  purpose  of  the  algebraic  analysis. 
Once  this  purpose  has  been  clarified,  then  the  algebraic  relations  among 
indices  can  be  defined  in  terms  of  assignments  of  s-values  for  the 
primitive  relations.  For  the  heuristic  reason  that,  for  a  sufficiently 
large  collection  of  sentences  in  storage,  the  negations  of  a  large  number 
of  words  and  phrases  are  likely  also  to  be  in  storage,  we  define,  for 
each  index  p  in  L  ,  an  index  -p  in  L  such  that  the  following  con¬ 
ditions  of  storage  verification  hold;  if  p  -  1,  -p  =  1  ;  if  p  =  0, 

-p  =  8  ;  and  if  p  =  8  ,  -p  =  0  .  The  element  -p  is  the  antonym  of  p 
in  a  descriptive  semantics. 

We  define  a  binary  relation  D  between  indices  of  L  in  terms  of  +he 
following  stipulation  of  s-values. 
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q  pDq 


P 

0  0  0 

Oil 
0  6  5 

10  1 

111 
16  1 

6  0  6 

6  11 

6  6  6 

We  also  define  a  'binary  relation  C  among  indices  of  L  such  that  C 
satisfies  the  identity  pCq  =  -  (-pD-q).  From  the  definitions  it  follows 
that  pCq  satisfies  the  following  table  of  s-valueso 

P  q  pCq 

0  0  0 

Oil 
0  6  0 

10  1 

111 
16  1 

6  0  0 

6  11 

6  6  6 

Examination  of  tne  s-table  for  pDqi  shows  that 

pDq  =  .  (1) 

Further,  from  the  definition  for  -p  ,  we  have 

-(“P)  =  P  •  (2) 
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Using  the  definition  for  C  and  (l),  we  have 


pCq  =  qCp  .  (3) 

Through  the  s-tahles,  it  csin  be  shown  that 

pD(qDr)  =  {pDq)Dr  ,  {k) 

and  it  is  easily  proved  that 

pC(qCr)  =  (pCq)Cr  (5) 


Again  through  the  s-tahles,  the  following  property  may  he  verified: 

pC(qDr)  =  (pCq)D(pCr)  (6) 

We  have  shown  that,  for  the  given  stipulation  of  s-values  the  relations 
C  and  D  are  commutative,  associative,  and  distributive  with  respect 
to  C  . 


Algebraic  strings  are  linearly  related  to  the  stroke  functions  X  and 
p  .  Let  X,  y,  z  he  any  indices  such  that,  if  a  denotes  either  X  or 
P  /  a(x,y),  a(x,z),  a(y,z)  are  all  indices  in  L  .  Then  we  define: 


Since  C  and  D 


We  define 


a(xCy,z)  =  a(x,z)Ca(y,z) 

(7) 

a(x,yCz)  =  a(x,y)Oo(x,z) 

(8) 

a(xDy,z)  =  a(x,z)Dci((y,z) 

(9) 

a(x,yDz)  =  a(x,y)Da(x,z) 

(10) 

I  commutative,  we  have 

a(xCy,z)  =  a(yCx,z) 

(11) 

a(x,yCz)  =  a(x,zCy) 

(12) 

a(xDy,z)  a(yDx,z) 

(15) 

a(x,yDz)  =  a{x,zDy) 

(14) 

a(-x,z)  =  a(x,-z)  =  -  a(x,z) 

(15) 

emd  show  that  this  definition  is  consistent  with  (7)  -  (lO). 
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a(xCy,z)  =  aI-(-xD-y),z] 

B  -  a(-“XD-y,z)  B  -  [a(“X,z)DQ!{-y,z)] 

B  -  [-a(x,z)D-a(y,z)]  =  a(x,z)Ca(y,z), 

and  similarly  for  a(x,yCz).  From  (I5),  we  have  the  identies: 


a(xCy,z)  «=  a{-xl>-y,-z)  (16) 

a(  xDy ,  z )  =  a(  -xC-y,  -z )  ( 1 7 ) 

a(x,yCz)  B  a(-x,-yD-z)  (18) 

a(x,yDz)  B  a(-x,-yC-z)  (I9) 

Further,  from  (7)  -  (lO)  and  associativity,  we  have 

a(uCv,xCy)  B 

a(u,x)Ca(u,y)Ca(v,x)Ca(v,y)  (20) 

a(uDv,xDy)  B  a(u,x)Da(u,y)DQ!(v,x)DQ!(v,y)  (21) 


Indices  containing  both  C  and  D  are  allowable  if  and  only  if  all 
indices  in  the  resulting  decomposition  have  the  same  s~value.  To  prove 
this  talce,  for  example,  the  index  a(uCv,xjDy)  .  We  have 

a(uCv,xDy)  B  a[-(uCv),  -xC-y]  s  -  a(uCv,  -xOy)  b 
-  [-a(u,x)C-a(u,y)c-a(v,x)C"a(v,y)] 

On  the  other  hand,  we  also  have 

a(uCv,  xDy)  B  a[-iiD"V,  -(xDy)]  b  -a(-uI>"V,  xDy)  b 
-  [-a(u,x)D-A(u,y)D-a(v,x)D-a(v,y)] 

The  two  decompositions  can  be  equal  only  if  all  indices  in  the  strings 
are  either  0,  6,  or  1  . 

From  the  transitive  property  of  the  stroke  functions,  we  have 

a(xCy,z)a(z,u)  b  a(xCy,u)  (22) 

Since  a(xCy,z)  b  a(x,z)Ca(y,z),  the  sequence  of  indices  must  be  dis¬ 
tributive  with  respect  to  C  and  D  ;  thus 


9 


LOCKHEED  AIRCRAFT  CORPORATION 


MISSILES  and  SPACE  DIVISION 


a(xCy,z)a(z,u)  =  [a(x,z)ca(y,z)]a(z,u)  = 

a(x,z)a(z,u)Ca(^,z)a(z,u)  =  a(x,u)Cb!(y,u)  = 
a(xCy,u) 

However,  a(u,xCy)a(xCy, v)  =  a(u,v)  if  and  only  if  x  =  y  ,  and  similarly 
if  C  is  replaced  by  D  in  one  or  both  indices. 

Using  (2)  and  (I5),  we  have 

a(-x,y)  a  (y,  -z)  -  a(x,-y)  a  (  y,z)  - 

[-0!(x,y)]  [-a(y,z)]  =  a(x,z)  (23) 

So  far,  we  have  not  used  any  property  of  6  in  our  results,  and  hence 
the  definitions  and  consequent  properties  of  both  algebraic  and  syntactic 
strings  hold  for  a  Boolean  algebra  as  well.  An  interesting  application 
of  these  results  would  be  the  analysis  of  the  predicate  calculus  in  terms 
of  this  theoryj  however,  \re  have  only  fragmentary  results  on  this 
application,  and  shall  not  include  them  here. 
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Section  k 

SYNTACTIC  THEORY  OF  DEFINITIONS 


It  was  proposed,  in  the  first  report  on  this  study  [l],  to  compile  a 
list  of  words  to  he  taken  as  undefined  and  define  all  other  words  in  the 
general  lexicon  in  terms  of  the  primitive  words,.  The  novel  feature  of 
our  proposal  was  to  define  words  in  such  a  manner  that  the  string  of  the 
definition  is  an  allowable  expansion  of  the  syntactic  form  of  the  vord 
defined.  This  feature  would  permit  free  substitution  of  definitions  in 
the  context  of  the  words  defined,  and  hence  would  permit  correlation  of 
sentences.  The  theory  of  definition  Is  the  heart  of  descriptive  seman¬ 
tics,  and  it  is  in  this  area  of  research  that  any  method  or  proposal  for 
word  correlation  must  be  conceived.  As  was  indicated  in  [4],  we  have 
encountered  considerable  difficulty  in  writing  satisfactory  definitions 
within  the  proposed  syntactic  restrictlcxis .  A  "satisfactory"  definition 
is  slnply  a  matter  of  the  conscience  and  experience  of  the  person  writ¬ 
ing  the  definition,  and  part  of  the  difficulty  lies  herein — we  have  no 
criteria  for  an  acceptable  definition.  The  other  part  of  the  difficulty 
is  that  the  theory  of  definition  at  this  level  is  not  properly  a  scien¬ 
tific  problem,  even  for  those  who  choose  to  recognize  it  as  a  problem. 

It  is  a  philosophical  problem  in  the  present  state  of  our  knowledge  of 
human  behaviour  and  communication,  and  subject  to  endless  debate. 

The  purpose  of  a  definition  in  a  natural  language  is  to  establish  a 
correspondence  between  a  word  end  \diatever  the  word  neunes  or  whatever 
function  it  serves  in  the  language.  The  manner  in  which  this  purpose 
is  accomplished  is  by  stating  a  correspondence  between  the  word  and  a 
collection  of  other  words  in  the  language,  placing  reliance  on  the  human 
Interpreter  to  perform  the  actual  process  of  naming  objects  of  experience. 
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and  whatever  other  functions  are  required.  Since  the  construction  of  a 
dictionary  or  thesaurus  involves  these  complex  human  activities  to  a 
sophisticated  degree,  and  since  in  our  present  state  of  knowledge  these 
activities  are  so  little  understood,  the  proper  objects  of  our  research 
are  simply  those  correspondences  between  words  and  collections  of  words 
which  are  already  established  in  conventional  dictionaries,  rather  than 
the  construction  of  a  new  dictionary.  We  have  limitations  imposed  by  the 
method  of  syntactic  analysis,  of  course,  and  these  restrictions  will 
induce  changes  in  the  precise  wording  of  definitions.  But  we  can  study 
the  problems  in  abstracto,  and  the  results  obtained  ought  to  tell  us 
whether  any  useful  concept  of  word  correlation  is  feasible. 

Generally,  a  definition  is  a  correspondence  between  a  syntactic  string 
and  a  sequence  of  syntactic  strings.  We  consider  here  the  correspondence 
between  a  single  word  and  a  single  string,  and  write  p  >  o  for  the 
correspondence,  where  p  is  the  word  defined  and  a  the  string  con¬ 
stituting  the  definition,  p  denotes  a  set  of  indices  each  of  which 
stands  in  the  given  relation  to  the  string  o’  .  Since  we  have  no 
criterion  for  substitutability  of  o  in  any  string  containing  an  index 
belonging  to  p  ,  the  relation  >  is  not  an  equivalence  relation.  We 
define  the  equivalence  relation  s  between  indices  as  follows: 

p  =  0  if  and  only  if  p  >  o  and  a  >  p  , 

the  relation  holding  for  all  indices  belonging  to  p  .  We  define  the 
relation  >  in  terms  of  the  storage-dependent  algebra: 

p  >  0  =  -  pDo  ( 24 ) 

The  relation  >  may  hold  for  any  two  indices  in  L  .  In  the  thesaurus- 
type  of  definition,  where  a  word  is  "defined"  in  terms  of  a  collection 
of  words,  the  relation  holds  between  the  word  defined  and  each  word  In 
the  collection.  From  the  definitions,  we  have  the  following  table  of 
s-values  for  >  : 
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p  q.  p  >  q 

0  0  6 

oil 
0  6  6 

10  1 

111 
16  1 

6  0  0 

6  11 

6  6  6 

The  first  set  of  values  reflects  the  fact  that  the  semantic  correspondence 
between  any  two  Indices  does  not  depend  upon  the  storage  of  the  indices. 
Further,  if  q  =  a  ,  a  string,  then  the  correspondence  may  impiieitly  te 
in  storage  through  the  words  in  6  ,  though  o  itself  need  not  be  in 
storage . 

The  semantic  correspondence  relation  has  the  following  elementary 
propert.ies: 


P  >  q  =  -  (pC-q) 

(25) 

a(x,y  >  z)  =  a(x,y)  >  a(x,z) 

(26) 

a(x  >  y,z)  =  a(x,z)  >  a(y,z) 

(27) 

The  form  a(x  y,u  >  v),  if  it  should  occur,  implies  x  r:  y  and 
u  =  V  ;  this  is  easily  seen  by  application  of  (26)  and  (27). 

The  purpose  of  the  three-valued  algebra  we  have  outlined  is  to  provide 
a  basis  for  semantic  correlation  relative  to  a  given  set  of  correspon¬ 
dences  between  words  and  strings  of  words.  It  is  our  intention  1o 
define  the  s-value  6  in  such  a  manner  that  it  provides  a  numerical 
measure  of  correlation  relative  to  the  given  correspondences.  This 
function  will  also  provide  a  basis  for  semantic  inference  in  L  rela1 ive 
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to  that  sub- language  of  L  which  is  in  storage.  Our  speculations 
regarding  the  form  of  this  numerical  function  center  aro\ind  statistical 
properties  of  indices  subject  to  the  syntactic  restraints  of  acceptable 
s*:  rings  and  to  the  imposed  structure  of  the  storage  algebra.  Results 
obtained  at  this  time  are  too  indefinite  for  inclusion  in  this  report. 
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Section  5 

TECHNICAL  SUMMARY  AND  CONCLUSIONS 

We  present  in  this  section  a  summary  of  the  more  iinportant  technical 
results  obtained  for  this  contract. 

The  subject  of  analysis  is  the  English  language  which  occurs  in  meaning¬ 
ful  discourse;  in  particular,  the  analysis  of  meaningful  declarative 
sentences.  The  syntsuc  of  this  language  was  analyzed  by  the  method  pro¬ 
posed  by  Bar-Hillel  [5]  and  Lambek  [6].  The  result  was  a  sublanguage  of 
the  original  language  consisting  of  sentences  for  which  the  method 
applies  and  the  words  which  occur  in  those  sentences.  This  method  of 
analysis  was  extended  operationally  to  include  a  larger  set  of  sentences 
for  experimental  purposes.  This  extension  is  termed  "macrosyntax"  in  our 
reports . 

One  of  the  wesiknesses  of  the  macrosyntax  is  the  ambiguity  of  association 
in  the  more  complex  forms.  For  example,  the  form  n\s/n  can  be  taken 
as  either  of  the  forms  (n\s)/n  or  A(  s/n).  A  resolution  of  this 
ambiguity  was  foimd  by  defining  two  semigroups  (A,/)  and  (B,\),  with 
A  =  {nj^,  Sj^,  ej^)  and  B  =  (ng,  Sg,  e^)  ,  then  defining  mappings  which 
carry  cross-products  of  the  two  sets  into  sets  whose  members  are  the 
non-associative  forms.  This  syntax  was  termed  "microsyntax." 

It  was  then  recognized  that  the  indicial  feature  of  the  microsyntax  could 
be  used  to  index  words  in  context.  Thus,  n^,  ng,  nV 

Sj^,  S2,  etc.  are  all  sets  whose  elements  are  words  or  strings  of  words  in 
the  natural  language  used  in  particular  sentences  with  meanings  intended 
in  those  sentences.  This  method  of  indexing  was  called  "microsyntactic 
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indexing,"  eoid  represents  a  recursive  translation  of  the  natural  lan¬ 
guage  into  a  machine  language  suitable  for  storage  and  retrieval  puiposes. 
The  ansLlysis  of  the  syntetx  of  the  natural  language,  vith  this  recursive 
translation  into  the  microsyntactic  indices,  vas  felt  to  be  an  essential 
part  of  the  groundwork  for  word  correlation.  This  is  because  the  meanings 
of  words  depend  upon  their  usage  and  functions  in  context,  and  a  method 
for  preserving  context  \diile  simulteuieously  permitting  algebraic  operations 
with  individual  words  was  essential.  We  feel  that  microsyntactic  index¬ 
ing  satisfies  this  technical  requirement. 

An  algebra  suitable  for  operations  among  indices  in  storage  is  outlined  in 
this  report.  A  three-value  algebra  was  chosen  so  that  one  of  the  values 
could  be  used  as  a  correlation  function  between  'ay  pair  of  indices. 
Relations  between  the  stroke  functions  of  the  microsyntax  8uad  the  algebraic 
relations  were  defined  and  simple  consequences  of  the  definitions  were 
derived.  A  characterization  of  the  correspondence  between  a  word  and  a 
string  which  defines  the  word  was  proposed  and  defined  in  terms  of  the 
algebra.  This  is  felt  to  be  the  most  inportant  technical  result  of  our 
research,  as  it  is  through  the  analysis  of  these  correspondences  that  the 
theory  of  word  correlation  must  be  constructed. 

To  each  word  in  the  natural  langueige  there  corresponds  a  set  of  microsyn- 
tactic  indices,  at  most  one  index  for  each  occurrence  of  the  word  in  the 
material  analyzed.  In  addition,  each  sentence  which  is  analyzable  in  the 
microsyntax  is  indexed.  The  storage  can  be  manipulated  at  will  using  a 
Boolean  algebra;  for  example,  various  hypotheses  could  be  constructed  and 
consequences  of  the  hypotheses  derived  automatically  by  machine.  However, 
the  technique  of  using  a  Boolean  algebra  presupposes  a  precise  knowledge 
of  the  contents  of  the  storage,  and  when  storage  is  large,  the  Boolean 
algebra  becomes  impractical.  An  automatic  program  for  word  correlation 
is  essential  as  a  guide  for  the  analyst.  If  a  correlation  function  were 
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available  its  application  would  tell  the  analyst  in  which  direction  to 
proceed  in  storage  manipulation  without  reading  out  the  entire  storage. 

In  other  words,  the  correlation  function  serves  as  a  tool  for  plausible 
inference;  from  the  value  of  the  correlation  function  in  each  particular 
instance  the  analyst  is  able  to  infer  the  contents  of  storage  and  con¬ 
struct  the  Boolean  search  function  accordingly.  These  are  the  reasons 
we  have  chosen  a  three- value  storage  algebra. 

It  is  unclear  at  present  what  form  the  correlation  function  must  have  to 
be  useful  in  the  intended  way.  It  must  be  independent  of  the  cardinality 
of  index  sets  corresponding  to  words,  yet  it  seems  reasonable  that  the 
function  be  related  to  the  probability  that  a  given  word  is  in  storage. 

On  the  other  hand,  since  we  are  seeking  a  semantic  correlation  function, 
we  have  no  a  priori  reason  to  ass\ime  that  probability  enter  the  picture 
at  all.  A  scheme  for  computing  word  correlation  similar  to  the  simple 
ideas  originally  proposed  in  [l]  may  serve  the  intended  purpose,  with 
suitable  criteria  for  substitution. 


17 


LOCKHEED  AIRCRAFT  CORPORATION 


MISSILES  and  SPACE  DIVISION 


Sectioa  6 
REFERENCES 


1.  Lockheed  Missiles  and  Space  Division,  Word  Correlation  Study,  I,  by 
R.  P.  Mitchell  and  E.  Greer,  Contract  No.  AP  30(602)-l689,  Sunn^ale, 
Calif. 

2.  Lockheed  Missiles  and  Space  Division,  Word  Correlation  Study,  II,  by 
R.  P.  Mitchell,  Contract  No.  AP  30 ( 602)-l689 ,  Sunnyvale ,  Calif. 

3.  Lockheed  Missiles  and  Space  Division,  Word  Correlation  Study.  Ill,  by 
R.  P.  Mitchell,  Contract  No.  AP  30(602 )-l869,  SunnyvaJ.e,  Calif. 

4-.  Lockheed  Missiles  and  Space  Division,  Proposal  for  Continuation  of 
Word  Correlation  Study.  Contract  No.  AP  30(&2)-l889,  mSD-49901, 
Sunnyvale,  Calif.,  27  Jul  1959- 

5.  Bar-Hillel,  Y.  A.,  "Quasi-Arithmetical  Notation  for  Syntactic 
Description,"  Language .  pp.  4-7-58,  1953 • 

6.  Lambek,  J. ,  "The  Mathematics  of  Sentence  Structure,"  American 
Mathematical  Monthly,  pp.  154-170,  Mar  1956* 

7.  West,  M.  P.,  A  General  Service  List  of  English  Words,  New  York, 
Longmems,  Green  suid  Co.,  1957' 


18 


LOCKHEED  AIRCRAFT  CORPORATION 


MISSILES  and  SPACE  DIVISION 


Appendix  A 

WORD  CORRELATION  STUDY  PROGRAM 


General  Description; 

Given  a  list  of  words  with  their  frequencies  of  occurrence  in  five 
million  words  and  the  percentages  of  occurrence  as  different  syntactic 
types,  this  program  will  produce  a  readable  list  consisting  of  the  words, 
their  frequencies  of  occurrence  in  one  million  words,  and  the  frequencies 
of  occurrence  as  the  different  syntactic  types.  Where  the  percentage  is 
unknown,  the  program  will  provide  the  number  —  1111  1111  -  11  as  an 
indicative  symbol  that  the  word  is  known  to  occur  as  a  certain  syntactic 
type  but  the  programmer  must  put  in  the  symbol  in  the  desired  location  in 
the  output  buffer.  See  Note  2. 

The  program  requires  all  of  first  core  and  up  to  location  1024-0  of  second 
core.  The  remainder  of  second  core  may  be  used  to  enlarge  the  program  or 
make  comparisons  between  the  words  and/or  numbers  associated  with  them. 

Output  Format; 

Each  'page'  is  headed  by  the  page  number  and  five  words.  The  syntactic 
types  are  represented  by  numbers  from  JO  to  60,  and  appear  to  the  left. 
Each  word  heads  a  column  of  numbers,  the  first  of  which  is  the  frequency 
per  million  words,  and  the  others,  opposite  the  syntactic  types, 
represent  the  occurrence  of  the  word  as  the  various  types. 

Input ; 

Input  to  the  program  is  by  tape.  For  every  page,  the  five  words  to  be 
printed  on  that  page  must  be  punched  on  one  card  in  the  following  format; 
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column  1  begin  first  word 

column  19  second  word 

column  37  third  word 

column  55  fourth  word 

column  75  fifth  word 

(Note:  If  the  fifth  word  has  more  than  eight  letters, 
start  in  the  number  of  column  to  the  left  of  column  75 
that  equals  the  number  of  letters  greater  than  eight.) 
e.g.,  a  nine- letter  word  must  start  in  column  72;  a 
twelve- letter  word  should  start  in  colvunn  69.  If  pos¬ 
sible,  words  with  more  than  twelve  letters  should  not 
be  the  fifth  word  on  a  page. 

The  numbers  associated  with  every  set  of  five  words  are  punched  on  cards 
separate  from  the  word  card.  Using  the  standard  VARAB  or  VARCAR  format 
(the  same),  punch  the  five  frequencies,  followed  by  the  five  percentages 
for  the  first  syntactic  type,  next  the  five  percentages  for  the  second 
syntactic  type,  and  so  on.  Zeros  must  be  punched  when  a  word  doesn't 
occur  as  a  particular  type.  There  should  be  five  cards  of  numbers  of 
output  desired,  for  each  one  card  of  words  for  each  page.  This  is  for 
15  syntactic  types.  If  more  types  are  given,  the  argument  word  in 
location  00131 )B  must  be  changed  from  05  OOOO5  00010  to  05  OOOnn  00010, 
n  »  number  if  octal,  of  cards  to  be  read  for  each  set  of  five  words. 

Operation: 

Card-to-tape  program  deck  I;  card-to-tape  'data  deck  II!  (the  word  cards); 
card-to— tape  'data  deck  III'  (the  number  cards).  Place  II  on  uniservo 
No.  4,  III  on  uniservo  No.  5;  load  octal  program  tape  number.  Output  is 
on  uniservo  No.  5*  The  program  starts  at  cell  00250)B  and  should  stop  at 
05043 )B.  The  original  program  provides  for  100  pages  of  output,  or  5OO 
words.  For  fewer  pages  an  MJ  to  0504 3B)  should  be  provided  at  the  end  of 
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the  nujttber  of  pages  req.ulred.  This  may  be  accompllBhed  by  em  octal 
corrector  with  the  program  deck,  In  which  case  the  program  must  be  card> 
to-taped;  or  by  a  foua>-fleld  octal  loeuier  corrector,  which  must  be  loeided 
after  the  program  tape  (before  program  is  started). 

Notes ; 

1.  The  output  buffer  consists  of  an  84  word  BUFF  and  324  word  UftiLD.  The 

syntactic  type  symbols  go  to  H)6lO  H)6lD^+^^6,  Hj^ID  +  72,  Hj6LD  +  7^> 

Hj^LD  +  84;  H0LD  +  96,  H0LD  +  102,  H0LD  +  108,  H0U)  +  ll4,  H0LD  +  120, 

H0LD  +  132,  etc.  The  numbers  associated  with  these  types  are  in  locations 
called  H0LD  +  6l  through  H0LD  +  65;  H01iD  +  67  through  H0LD  +  71}  so  on. 

2.  If  the  indicative  number  -  1111  1111  -  11  Is  desired,  for  exan5)le, 
opposite  type  3I,  under  the  second  word  on  a  page,  then  TP  STAR  H0LD  +  68 
must  be  inserted  by  programmer  in  20  word  buffer  provided  for  that  purpose 
for  each  page  before  PREDIT  is  called  to  output  the  page. 


21 


LOCKHEED  AIRCRAFT  CORFORATION 


MISSILES  and  SPACE  DIVISION 


EFFICIENT  ESTIMATION  OF 
REGRESSION  PARAMETERS  FOR  CERTAIN 
SECOND-ORDER  STATIONARY  PROCESSES 


C.T.  Striebel 


FOREWORD 


This  reprint  was  published  originally 
as  a  technical  report,  D4SD-288079> 
under  the  same  title,  November  1959- 
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ABSTRACT 


The  problem  considered  is  that  of  estimating  the  single  regression  para¬ 
meter  k  in  the  presence  of  a  second-order  stationary  disturbing  process 
X(t)  with  rational  spectral  density.  If  the  process 

y(t)  =  k<^(t)  +  X(t) 

is  observed  in  the  continuous  interval  o  ^  t  s  T  ^  a  linear  unbiased 

-T 

estimate  k  of  k  is  said  to  be  efficient  if  its  variance  is  asymptoti¬ 
cally  minLmum  among  all  linear  unbiased  estimates. 


For  known  mean  value  functions  of  the  form 


t(t)  .  tV"  Z  ♦  e^V 


efficient  estimates  are  given  for  the  two  cases  a  =  o  and  a  >  o  .  They 
are  shown  to  be  economical  of  information  concerning  the  spectral  density 
in  that  no  estimate  exists  which  is  efficient  for  a  wider  class  of  spectra. 
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Section  1 

THE  PROBLEM  AND  ITS  BACKGROUND 

Let  x(t)  be  a  second  order  stationary  stochastic  process  with  mean 
value  function  zero  and  covariance 

E[x(t)  x'(s)]  r.-  R(t-s)  ,  (l.l) 

and  suppose  that  the  process 

y(t)  =  kcp(t)  +  x(t)  (1.2) 

is  observed  for  the  continuous  time  parameter  t  in  the  interval 
0  g  t  i  T  .  The  function  cp(t)  is  known,  and  the  unknown  parameter  k 
is  to  be  estijiated.  This  process  can  be  thought  to  consist  of  a 
systematic  component  kcp(t)  ,  which  is  completely  predictable  except  for 
the  unknown  scale  factor  k  ,  plus  a  random  disturbing  component  x(t)  . 

The  problem  is  to  estimate  this  scale  factor  from  the  observed  process. 

Only  linear  unbiased  estimates  will  be  considered,  and  they  will  be 
represented  as  linear  f'anctionals 

[y(t),  0  S  t  s  T]  (1.3) 

which  are  limits  in  quadratic  mean  of  finite  linear  combinations  of  the 

process  y(t^),  i  =  1,  ...,  N  where  the  arguments  t^^  are  in  the 

— T 

interval  0  ^  t^  g  T  .  Such  an  estimate  k  is  said  to  be 
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asymptotically  efficient,  or  simply  efficient,  for  the  problem 
[<p(t),  R(t)]  if  it  satisfies  the  condition 


-T 

variance  k 
variance  k 


>  1 


as  T  >  «  , 


(lA) 


where  k  is  the  minimum  variance  linear  unbiased  estimate  of  k  com¬ 
puted  for  the  problem  determined  by  the  mean  value  function  cp(t)  and 
the  covariance  R(t)  . 


Interest  in  efficient  estimates  arises  from  the  fact  that  the  "best” 
estimate  k  is  usually  very  inconvenient.  If  the  estimate  k  is 

r  /  \  ^ 

represented  by  the  linear  functional  k  [y(t),  0  ^  t  S  TJ  then  it  must 
satisfy  the  linear  equation 

Ic^CRCt-s),  0  s  t  g  T]  =  m'^<p(s)  0  g  s  s  T  ,  (1.5) 


WT 

where  M  is  a  constant  to  be  determined  by  the  condition  of 
unbiasedness, 


^^[cp(t),  0  g  t  g  T]  -  1  .  (1.6) 

For  most  combinations  of  functions  cp(t)  and  R(t)  it  is  difficult  to 

exhibit  this  solution  explicitly;  and  provided  it  can  be  exhibited  at  all, 

the  solution  usually  depends  on  complete  knowledge  of  R(t)  and  is 

cumbersome  to  compute.  Efficient  estimates  are  provided  by  linear 
—T 

functional  k  which  are  in  a  sense  asymptotic  solutions  to  the  linear 
equation  (1.5). 
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The  principal  alternative  estimate,  -vdiich  has  been  proposed,  is  the 
least  square  estimate, 

=  /  ?rtTy(t)dt/ ncp(t)  i^dt .  (1.7) 

This  estimate  has  the  advantages  that  it  is  easy  to  compute  and  requires 
no  knowledge  whatever  of  the  covariance  R(t)  .  Previous  work  on  the 
problem  of  efficient  estimates  has  been  primarily  devoted  to  determining 
those  combinations  of  functions  cp(t)  and  R(t)  for  which  the  least 
square  estimate  is  efficient. 


For  the  Omstein  Uhlenbeck  process,  that  is, 

R(t)  =  e-Pl^l  ,  (1.8) 

and  for  mean  value  functions 

<p(t)  =  t^  or  ,  (1.9) 

where  r  is  a  non-negative  integer  and  is  a  real  frequency,  Mann 

and  Moranda  in  reference  [lO]  proved  that  the  least  square  estimate  is 
efficient.  The  author  in  reference  [I3]  extended  this  result  to  include 
mean  value  functions  of  the  form 

<p(t)  =  (1.10) 


and  showed  further  that  for  the  more  general  function 


9(t)  =  L 


a=i 


(1.11) 
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where  the  cp^  are  non-zero  constants  and  n  >  1  ,  the  least  square 
estimate  is  not  efficient. 


For  a  much  broader  class  of  covariance  functions  R(t)  and  essentially 
the  same  mean  value  functions  (p(t)  ,  this  problem  was  first  discussed 
by  Grenander  in  reference  [3].  Further  work  was  carried  out  by  Grenander 
and  Rosenblatt  in  references  [4]  and  [5].  Rosenblatt  considered  some  of 
the  same  problems  in  the  case  of  vector  valued  time  series  in  reference 
[11]  and  extended  his  results  in  [12] .  Most  of  these  results  together 
with  some  examples  appear  in  Chapter  7  of  reference  [6].  In  this  work 
only  the  discrete  parameter  case  is  considered,  and  the  mean  value  func¬ 
tion  cp(t)  is  assumed  to  be  "slowly  increasing."  This  assumption 
requires  that  the  function  q)(t)  be  essentially  of  form  (1.11).  All 
restrictions  on  the  class  of  covariances  are  imposed  on  the  equivalent 
class  of  spectral  densities  f(A)  ,  which  by  assumption  exist  and  sat¬ 
isfy  the  relation 


R(t)  /"  e^^*f(A)dA 
-n 


(1.12) 


for  a  discrete  parameter  process  a}id 

R(t)  r 


(1.13) 


for  a  continuous  parameter  process.  Thus  a  problem  [cp,R].  can  be 
referred  to  by  the  equivalent  [<p,f]  .  For  positive  continuous  spectral 
density  and  "slowly  increasing"  mean  value  function,  a  necessary  and 
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sufficient  condition  is  given  in  reference  [6]  for  the  least  square 
estimate  to  he  efficient.  The  condition,  which  is  the  same  as  that  foiind 
in  reference  [I5]  for  the  continuous  parameter  Omstein  Uhlenheck  process 
and  mean  value  function  of  the  form  (l.ll),  is  that  f(-X^)  he  constant 
for  a  =  1,  . . . ,  n  . 

In  Chapter  I.3  of  reference  [7]  Grenander  reproduces  a  few  of  these 
results  using  the  methods  of  Toeplitz  forms.  In  Chapter  1.4,  imder 
certain  regularity  conditions  on  f(X)  ,  he  extends  his  results  to  the 
continuous  parameter  case  foi  the  single  example 

cp(t)  =  1  .  (I.l4) 

With  the  exception  of  those  in  reference  [7],  all  the  above  mentioned 
results  are  derived  for  the  more  general  problem 

Ety(t))  = 

where  the  are  unknown  parameters  and  the  cpj^(t)  are  known  functions. 

For  p  >  1  ,  the  definition  of  efficiency  used  hy  Ifenn,  Moranda,  and 
Striehel  is  different  from  that  used  hy  Rosenblatt  and  Grenander.  For 
the  case  p  =  1  ,  both  agree  with  definition  (1.4)  made  above.  In  the 
present  paper  only  the  case  p  =  1  will  be  considered  though  it  is 
believed  that  the  results  obtained  could  be  generalized  to  larger  p 
using  the  methods  and  definitions  of  efficiency  given  by  Rosenblatt  and 
Grenander. 
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In  the  present  paper  f(A)  is  assumed  to  be  a  rational  function  with  no 
real  zeros.  Again  it  is  believed  that  this  requirement  could  be  slightly 
relaxed  so  as  to  include  the  case  considered  in  Chapter  1.1<-  of  reference 
[7]  though  this  has  not  been  attempted. 

Under  these  two  limitations,  for  the  continuous  parameter  process  the 
results  in  references  [lO],  [133,  and  [7],  mentioned  above,  are  special 
cases  of  the  theory  to  be  presented  here.  The  continuous  parsuneter 
equivalents  of  the  important  exan^jles  discussed  in  reference  [6]  are  also 
covered. 
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Section  2 
SUMMARY 

In  the  results  summarized  above  it  has  been  shown  that  in  certain 
circumstances  the  least  square  estimate  is  efficient  for  a  large  class 
of  spectral  densities.  It  shall  be  the  purpose  here  to  discuss  efficiency 
classes  of  spectral  densities  defined  as  follows:  For  a  fixed  mean  value 
function  <p(t)  ,  a  class  of  spectral  densities  T  and  a  member  f(X)  of 
,  the  efficiency  class  ^(f,<p)  of  the  density  f(X)  in  the  class  ^ 
with  respect  to  the  mean  value  function  cp(t)  is  defined  to  be  all  mem¬ 
bers  g(\)  of  for  which  there  exists  an  estimate  (not  necessarily  the 
least  square)  that  is  efficient  for  the  two  problems  [(p,f]  sind  [<p,g]  . 
Thus  for  <p(t)  =  1  and  a  certain  class  of  densities  T  ,  Grenander  in 
Chapter  1.5  of  reference  [7]  shows  that  1^(f,cp)  =  7^  for  all  f  , 
since  in  this  case  he  establishes  that  the  least  square  estimate  is 
efficient  for  all  members  of  his  class  T-  . 

Section  5  is  devoted  to  establishing  the  efficiency  of  certain  proposed 
estimates  (2.10),  (2.11),  and  (2.I5).  It  is  also  shown  that  for  questions 
of  efficiency  the  problem  based  on  the  process  y(t)  for  t  is  the 
finite  intejTval  0  S  t  S  T  can  be  extended  to  an  equivalent  problem  with 
t  in  the  semi-infinite  interval  -  »  <  t  s  T  .  This  property  is  used  in 
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section  k  to  apply  knovn  results  for  the  latter  problem  in  obtaining  a 
necessary  condition  that  a  density  g(X)  belong  to  an  efficiency  class 
?<f,cp)  . 


The  mean  value  functions  (p(t)  which  will  be  considered  are  of  the  form 


m  i  iX  t 

cp(t)  =  Z  Z  cp  . 


7=1  j=o 

where  r^  is  a  non-negative  integer,  cp^  and  X^  are  complex, 


(2.1) 


max 


4(X  )  =  a  g  0  , 


auid  cp  #  0 

Tyj. 

7 


lg7gm  ' 

The  spectral  densities  considered  are  assumed  to  be 
rational  functions  with  no  real  zeros.  For  spectral  densities  of  this 
type,  the  solution  of  the  linear  equation  (1.5)  for  the  finite  interval 
0  S  t  §  T  is  outlined  by  Laning  and  Battin  in  CJhapter  8.4-  of  reference 
[8].  However,  this  estimate  ‘5'^C^(t),  0  ^  t  S  T]  will  not  be  used  here, 
since  it  will  be  shown  that  efficient  estimates  can  instead  be  compared 
with  the  much  more  convenient  estimate  k  [y(t),  -  »  <  t  S  t)  . 


(2.2) 


In  order  to  exhibit  the  estimates,  whose  efficiency  is  considered  in 
section  3^  some  notation  must  first  be  developed.  The  one-sided  Laplace 
treuisform, 

<I>(X)  =  /  e"^^'‘^<p(t)dt  ,  (2.3) 

o 

of  cp(t)  given  by  (2.1)  can  be  written 
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I 

I 

I 


m 

<I>(A )  =  Z 

r 


r^+i 


rj 


(2.4) 


where  ®  .  ,  .  Let  r  be  the  maximum  of  r  among  sub- 

rJ  rJ+i  7 

scripts  y  for  which  -8,X^  =  ^  Indicate  by  a  =  1,  . . . ,  n  those 

subscripts  for  which  -Jl  (A^)  =  a  and  r^  =  r  .  It  will  be  seen  that  for 

questions  of  efficiency  only  those  terms  of  the  mean  value  function  whose 

I*  st 

absolute  value  increases  as  t  e  will  be  pertinent.  Thus  the  mean 
value  function  (2.1)  can  in  effect  be  replaced  by 


?(t)  =  • 

a=i 


(2.5) 


Using  the  method  described  by  Doob  on  page  5^2  of  reference  [2],  the 
spectrum  can  be  factored  and  written  in  the  form 

f(A)  =  |F(A)|^  ,  (2.6) 

where  F(A)  is  a  rational  function  with  zeros  and  poles  in  the  upper 
half-plane,  >  O)  .  By  the  division  algorithm  the  quotient  of 

polynomials  1/F(A)  can  be  expanded  as 

p^-E(A)+M(A)  (2.7) 

where  E(A)  is  a  polynomial  of  degree  e  and  M(A)  is  a  proper 
rational  function  with  poles  in  the  upper  half-plane .  By  proper  it 
is  meant  that  the  degree  of  the  numerator  is  less  than  the  degree  of  the 
denominator.  Similarly,  the  following  expansions  can  be  made: 
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'or+l 


Fr=^(X+X^)F(X) 


=  E^(X)  +  M^(X)  a  =  1,  •••,  n  (2,8) 


where  F^^CX)  is  a  polynomial  of  degree  e-1  ;  and,  provided  a  >0  , 
M^(X)  is  a  proper  rational  function  with  poles  in  the  upper  half-plane. 
Let 


E(X)  =  £  e 

j=o  J 


J=o 


(2.9) 


m(t)  =  -^  /  e^'^''M(X)dX 


•»00 


1  IXi* 

^  ^  ®  MjX)dX  a  =  1,  n  . 


In  section  5  "the  following  estimates  are  proved  efficient; 


M  k^[y(t),  0  S  t  «  T;  a  >  0] 


(2.10) 


=  £  ( j)(T)  +  f  m  (T  -  t)y(t)dt]  ; 

a=i  j=o  0 


M  k2[y(t),  0  g  t  s  T;  a  >  O] 


(2.11) 


e 

=  £ 
j=o 


(-i)jejY^’5^(T)  +  f  m(T  -  t)Yj(t)dt  , 


where 


10 


LOCKHEED  AIRCRAFT  CORPORATION 


MISSILES  and  SPACE  DIVISION 


(2.12) 


”  ,lAa(T  -  t)  /  eiVy(u)du  ; 

ce=i  F(-Xq)  o 


and 

SFl^ry(t),  0  s  t  s  Tj  a  =  O]  (2.13) 

»  S  /  t'al>'aV(t)dt  . 

0=1  O 

In  each  case  1?'  denotes  a  constant  to  be  determined  so  that  the  esti¬ 
mate  is  unbiased.  It  is  evaluated  by  replacing  y(t)  with  cp(t)  in 
the  right  side  of  the  expressions  (2.10),  (2.11),  and  (2.13).  As  the 
notation  indicates,  estimates  (2.10)  and  (2.11)  are  efficient  in  the 
case  a  >  0  ,  and  estimate  (2.13)  is  efficient  when  all  the  frequencies 

are  real,  that  is,  a  =  0  .  In  order  to  compute  the  estimates  (2.10) 

and  (2.11),  k^[a  >  O]  and  kgt®'  >  Oi  >  necessary  that  the  entire 

spectral  density  f(X)  be  known.  Estimate  k  [a  =  O],  (2.13),  requires 

knowledge  of  the  spectrum  only  at  the  points  -A  ,  that  is, 

a 

a  =  1,  n  . 


Results  concerning  the  least  square  estimate  can  be  obtained  by  letting 
F(A)  =  1  ,  for  which  expressions  (2.10),  (2.11)  and  (2.13)  reduce  to 
the  least  square  estimate  for  the  equivalent  mean  value  function  (2.5). 
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Section  5 

EFFICIENT  ESTIMATES 


Since  f(A)  is  a  positive  rational  function  it  is  clear  that  the 
condition 


(3.1) 


1  +  A 

is  satisfied,  and  hence  by  Theorem  10.12b  of  reference  [7)  the  process 
x(t)  can  be  represented  as  a  moving  average 


x(t)  =  /  y(t-s)d|{s)  -00  <t<  00 


(3.2) 


vhere  |(s)  is  a  fundamental  random  process  with  zero  meeui  value  and 
y(t)  is  the  Fourier  Inverse  of  F(A)  , 


(3.3) 


Suppose  there  exists  a  function  >^(t)  which  satisfies  the  conditions 


/  7(t-s)\|f(s)ds  =  9(t)  0  g  t  <  » 


i3.h) 


and 


/  l^(s)|^ds  <  00  all  T  <  »  , 


(3.5) 


then  the  process  y(t)  can  be  represented  as  the  moving  average 
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t 

/(■t)  =  /  7(t-s)dT){s)  0  S  t  <  00  (3.6) 

«-00 

vdiere  t^(8)  is  the  process  defined  by 

M(s)  =  kitf(t)  +  |(t)  -  00  <  t  <  00  .  (3.Y) 

Thus  there  are  three  processes  which  will  be  considered 

I  y(t)  =  kcp(t)  +  x(t)  0  S  t  <  00 

II  y(t)  =  k^(t)  +  x(t)  -  oo<  t  <  oo 

III  n(t)  =  k\|r(t)  +  |(t)  -  00  <  t  <  00  . 

The  x(t)  processes  are  second  order  stationary  with  zero  mean  value 
eind  spectral  density  f(X)  .  The  |(t)  process  is  a  fundamental  random 
process.  The  mean  value  functions  satisfy  the  conditions 

y(t-s)\i;(s)dS  =  ^(t)  -  00  <  t  <  00  (3>8) 

«  00 

^(t)  =  Cp(t)  O  g  t  <  oo 

and  condition  (3.5)-  Their  trsinsforms  <Ii(A)  ,  given  by  (2.3),  (3-9) 

Z(X)  =  f  e“^^^^(t)dt  ,  (3.10) 

^  00 

and 

f(A)  =  /  e"^^H(t)dt  ,  (3.11) 

—  00 

are  assumed  to  converge  to  proper  rational  functions  in  some  non-degenerate 
strip.  They  satisfy  conditions  (2.4-)  and  from  (3.8) 

Z(A)  =  $(A)P(A)  .  (3.12) 
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The  integral  equation  can  be  solved  using  Laplace  transforms  and 

the  Wiener-Hopf  technique.  (See,  for  example,  page  313  of  [l^v].)  Taking 
two-sided  Laplace  transforms  of  equation  (3*^)  yields  the  equation 

$(X)  +  H(A)  =  f(X)F(A)  (3.13) 

where 

H(A)  =  f  7(t-s)t{6)dsdt  .  (3.1^) 


Solving  (3.13),  the  transform  i’(A)  must  satisfy 


(3.15) 


The  integred.  (t(A)  ,  (2.3),  converges  in  the  region  4^(A)  <  -  a  .  In 
order  that  the  Laplace  inverse  of  f(A)  in  (3 •15)  exist  and  satisfy 
conditions  (3.14-)  and  (3.5),  it  is  sufficient  that  H(A)  be  a  proper 
rational  function  with  poles  in  the  region  -^(A)  <  -  b  for  some 
-  b  <  -  a  and  that  $(A)  also  be  a  proper  rational  function.  Such  a 
function  H(A)  can  be  found,  for  example,  as  follows:  Let  I^(^) 
any  polynomial  with  zeros  whose  imaginary  parts  are  less  than  -  b  . 

By  the  division  algorithm  the  rational  function  $(A)I^(A)  can  be 
written 


$(A)Djj(A)  =  Q(A)  +  R(A)  , 


(3.16) 


where  R(A)  is  a  proper  rational  function  and  Q(A)  is  a  polynomial 
of  degree  equal  to  that  of  $(A)Djj(A)  — that  is,  the  degree  of  the 

numerator  of  $(A)Bl(A)  minus  the  degree  of  its  denominator.  Since  5>(A) 
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is  proper,  the  degree  of  q(X)  is  less  than  that  of  • 

H(X)  by 


Now  define 


(3.1T) 

SO  that 

H(X)+«(X)=^y  .  (3.18) 

By  tsLking  the  degree  of  Dg(X)  larger  than  e  ,  the  degree  of  1/F(X)  , 
a  proper  function  f(X)  is  obtained.  From  (5-12)  Z(X)  cem  be  defined 
by 

Z(X)  =  »(X)  +  H(X)  .  (3.19) 

Thus,  given  a  pjrocess  I,  the  existence  of  processes  II  and  III  has  been 
established. 

In  reference  [9],  Mann  has  solved  the  problem  of  estimating  the  parameter 
k  for  the  process  III.  The  minimum  variance  linear  unbiased  estimate  is 
given  by 

T 

iFfi^CnCt),  -  00  <  t  ^  T]  =  /  ii/(t)dn(t)  ,  (3.20) 

wOO 

and 

variance  k^[t](t),  -  <»<tsT3  =  l//  ly(t)l®dt  .  (3.2l) 

'  _  m 
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The  remainder  of  the  section  vill  be  devoted  to  obtaining  approximate 
expressicxis  for  the  estimate  (3. 20),  vhich  are  more  convenient  but  are 
still  efficient  for  the  problem.  The  efficiency  of  proposed  estimates 
^[a  >0],  5^[a>  0]  ,  and  ^[a  =  O]  given  by  (2.10),  (2.11),  and  (2.13) 
is  proved  in  Theorem  1.  However,  before  this  is  done,  a  non-rigorous 
derivation  of  the  estimates  will  be  given  by  way  of  motivation. 


First  equation  (3.6)  will  be  solved  for  dTj(s)  by  transform  methods. 

Since  the  poles  of  F(X)  are  in  the  upper  half -plane,  its  inverse  trans¬ 
form  7(t)  vanishes  for  t  <  0  .  Thus  the  process  y(t)  can  be  written 

00 

y(t)  =  /  7(t-8)dn(s)  ,  (3-22) 

«i»00 

and 

^  e“^V('t)dt  =  /  7(t-8)dTj(s)dt 


=  [/“  e”^^^^+®V(T)dT]dTi(s) 


(3.23) 


=  F(X)  e”^^®dTi(s)  . 
~  00 


Substituting  the  Laplace  Inversion  formula  for  both  liif(t)  and  dTi(t) 
in  formula  (3. 20)  gives  the  alternative  form  for  the  estimate 
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wFS'^[Tl(t),  -  «<  t  S  T]  =  /^  Tr^dii(t) 

«  00 


1 

(2it) 


■  -KOjic 
.-®-lc 


+«+id  ^iijt  T 
-oo+id  ^i«o 


e  ^‘^^y(u)du)  <aw 


?  dt 


> 


(3.24) 


■vdiere  c  is  in  the  region  of  convergence  of  the  integral  fCX)  in 

(j.ll),  b>c>a;d  is  taken  in  the  interval  0  <  d  <  c  vhere  the 

mean  and  variance  of  the  process  /  e  y(t)dt  and  the  integral 

/  e'^'^^^'^dt  are  all  finite.  The  notation  «(X)  indicates  that 

—  00 

complex  conjugates  of  the  coefficients  in  the  rational  function  ♦(X) 
are  taken. 


The  most  important  change  that  will  he  made  in  (5.24)  is  truncation  of 
y(t)  for  t  <  0  .  Thus  (3-24)  becomes 

0  ^  t  g  T]  /  1^  e"^‘*’V(t)dt]da,  (3.25) 

where 


f^(u))  = 


_1_ 

2n 


40)-lC 


/ 

00-lc 


^T(lai^-lX) 

i(u+X)  L  F(-X)  ■-» 


dX 


=  /^e”^*^iif(t)dt 


(3.26) 
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T 

Additional  changes  in  the  estimate  (5*25)  are  made  by  expanding  $  (u) 
as  a  8\un  of  residues  at  the  poles  of  ^>(X),  H(X)  and  1/f(A)  and  then 
picking  out  those  terms  which  dominate  at  T>  «  .  VJhen  terms  of 

are  omitted,  the  complex  integral  (5«25)  will  in  general  cease  to 
exist  in  the  common  sense.  It  must  be  replaced  by  a  Ceaskro  limit  of 
order  e  +  1  .  This  limit  is  defined  and  its  important  properties  are 
stated  in  Theorem  2.  The  final  estimates  k^[a>  O],  k^a>  O]  and 
k^[a  =  0]  given  by  (2.10),  (2.11),  and  (2.15)  are  then  obtained  by 
interpreting  the  resulting  Ceasdro  limit  as  the  Laplace  inverse  of 
appropriate  functions  of  y(t)  for  0  §  t  ^  T  . 


m 

The  simplification  of  f  (u)  will  be  different  for  the  two  cases  a  >  0 

m 

and  a  a  0  .  If  a>  0  ,  the  dominant  terms  of  $  (u)  have  magnitude 
(jirg(a+u)  ^  They  are  contributed  by  the  poles  of  4>(-X),  X  =  X^,  a  =  1, 

• • • ,  n  and  are  of  the  form 


r!  f(-XJ 


(3.2?) 


Making  this  substitution  in  (3.25)  gives  the  estimate 


SFk^[y(t),  0  s  t  g  T;  a  >  O] 


400+1  d 

-1-  f 

2n  -00+id 


n 

L 


otr+l 


,i(Xa4«)T 


?  e-^“Vt)dt 
o 


(3.26) 


du 
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Estlrouteti  (2.10)  uiui  (I'.ll)  ure  two  tiiltchtly  diri'ci'ojit  Interpretations  of 
this  Ceasaro  limit  as  a  Lupluue  Inverse.  The  existence  In  tiuadratlc  mean 
of  the  derivatives  of  y(t)  involved  In  estimates  (2.10)  and  (2.11)  can 
easily  be  established.  In  oz^er  to  show  that  (3.28)  Itself  provides  a 
bona  fide  estimate,  it  would  first  have  to  be  shown  that  the  Ceasaro  limit 
involved  exists  in  quadratic  mean. 


In  the  case  a  =  0  ,  the  magnitude  of  the  dominant  terms  in  the  variance 

T 


Of  i?'  1= 


They  are  contributed  by  terms  in  }  (w)  of  the  fonn 
1T(A(;^+U)) 

(3.29) 


rM  ~  2 


n  I® 

“  CKT+l 


r+i 


In  this  case  the  dominant  terms  in  (5.25)  are  the  residues  at  the  poles 
w=»-“X^  ,  which  are  evaluated  simply  in  the  estimate  k  [a  =  OJ  given 
by  (2.13)- 


Theorem  1.  Let  y(t),  0  s  t  g  T  ,  be  a  process  of  type  I  where  x(t) 
is  a  second  order  stationary  process  whose  spectral  density  f(X)  is  a 
rational  function  with  no  real  zeros,  and  the  mean  value  fimctlon  (p(t) 
is  of  the  form  (2.1).  Then  for  a  >  0  ,  the  estimates 
l^[y(t),  0  s  t  s  T;  a  >  0]  and  5^[y(t),  0  s  t  g  T;  a  >  o]  given  by 
(2.10)  and  (2.11)  are  efficient  for  estimating  the  parameter  k.  If 
a  =  0  ,  then  estimate  l?'[y(t),  0  g  t  g  T;  a  =  O]  given  by  (2.15)  is 
efficient. 
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Proof:  Since  any  linear  unbiased  "estimate  based  on  process  I 
(y('t’)>  0  S  t  s  T)  is  also  a  linear  vinbiased  estimate  for  process  III 


(n(t),  -  <ts  T),  it  follows  from  (5.21)  that  the  inequality 

variance  k^[y(t),  0  s  t  S  T]  i  l/jf  (\J(t)|®dt  (3.30) 

»00 

must  hold.  Thus  it  is  sufficient  to  show  that  the  limit 

T  « 

[variance  k  ][/  |\Jr(t)  |^dt]  >1  as  T  >  <*>  ^i) 

00 

~T 

holds  in  order  to  establish  efficiency  of  an  estimate  k  .  The  remainder 
of  the  proof  is  devoted  to  obtaining  asymptotic  expressions  for  the 

quantities  (i|r(t)[^dt  ,  fiF  ,  and  variance  iFk^  computed  for  the  three 

•>00 

estimates  considered.  The  notation 


T  T 
A  ~  B 


(3.32) 


will  be  used  to  mean 


-=7  1  as  T  >  “ 

B 


(3.33) 


In  addition  to  the  standard  Laplace  transform  inversion  formula,  the 
following  result  due  to  Amerlo  [l]  will  be  used: 

Theorem  2.  If  the  function  and  its  first  n  derivatives  are 

absolutely  continuous  in  the  interval  0  S  t  ^  b  ,  then  the  derivative 
cp^’^^t)  satisfies  the  inversion  formula 
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(3.3i^) 


R+ic 


/  (lX)"e"^»(x(l-|5)  dX  a.t.b 

’  R>ao2ll~R+iC  «/ 


where  $(>v)  is  the  transform  of  <p(t) 


^(X)  =  r  e"^^S(t)dt 


(3.35) 


and  the  line  =  ic  is  in  the  strip  of  convergence  of  this  integral  for 

$(A)  . 


In  addition,  if  $(A )  is  a  proper  rational  fiinction,  the  Ceasaro  limit 
(3.3^^)  can  he  evaluated  by  the  contour  integral  on  a  path  C  around  the 
poles  of  ^iX)  above  ic  if  t  >  0  and  below  ic  if  t  <  0  , 


lim  ~- 
R>oo  2n 


+R+ic 

/ 

-R+ic 


4>(A)(l  - 


2\n+i 


R‘ 


dA  =  .  (3 


From  the  Laplace  inversion  formula 


/l,^(t)|2dt  =7^  /  ""T  ""T 

—  00  —00  — 00— ic  — 00— ic 


(3.37) 


-too-ic  -Hxi-ic  T(iA+iw) 
^  ' 


-00-ic  -co-ic 


The  complex  integrals  are  taken  in  the  region  of  analytic ity  which 
separates  the  poles  of  <t7a)/p(A)  from  those  of  H(A)  .  These  integrals 
can  be  taken  as  contour  integrals  closings  them  upward  around  the  poles 
of  4“(A)/f(A)  •  Examination  of  the  residues  at  the  poles  of  '1>(A)  and 
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of  1/F(X)  shows  that  for  large  T  ,  the  domineuit  terms  are  contributed 
by  the  poles  with  minimum  imaginary  pajrt  and  among  these,  by  the  poles 


with  maximum  order — that  is,  by  X  -  -X  ,  a  =  1, 

0^ 


n  .  The  residue  of 
H(X)/f(X)  at  poles  of  ^CX)  is  zero,  so  the  asymptotic  expression 

T 


is  obtained,  where  the  contours  include  only  the  poles  X= -X^  and 

u)  =  X^,  a  =  1,  •  *  • ,  n  ,  Evaluating  this  integral  for  a  >  0  and  picking 

2l* 

out  the  terms  of  magnitude  T  e  ,  gives  the  asymptotic  expression 


f  |♦(t)|=dt 


m^r  2a 
T  e 


n 

E 

a=i 


n 

E 

p=i 


4,  T 

cer-fl  gr+l _ 


(3.39) 


This  can  be  summarized  by  the  form 

|iir(t)|®dt  ~  T^e^C(T)  (3.U0) 

—  00 

where  C(T)  is  boxanded  from  above  and  away  from  zero.  This  is  dear 
because  C(t)  is  a  positive  definite  form  in  the  vector 


^oor+l _ 

f(-x  ) 

\  Q,/ 


a  =  1, 


n 


(3.i^l) 


whose  terms  have  finite  positive  absolute  values  which  do  not  depend  on 

2r+i 

T  .  For  a  =  0  ,  terms  of  order  T  dominate  and  asymptotically 

the  integral  (5.38)  becomes 
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.2r+i 


i! 


For  a  >  0  ,  Mj[a  >  0  ]  can  be  evaluated  from  (2.10), 

fiF[a  >  0  ]  =  Z  e^^pZ^  (-i)'^e  +  /  injT-t)<p(t)dt 

■’•  Ot=i  J=o  «<J  o  “ 


Prom  Theorem  2  this  can  be  written 


;?[ta  >  0  1  =  ^  S  e"|A 


0=1  -00-iC  LJ=0 

(C,e+1) 


4  M^(X)4(X)  dA 


=  ~  °  *(X)  Z  (A)  4  M  (A))dA 

-  oo-ic  ,  <»=1 

(C,e4l) 


From  (2.8)  this  becomes 


a  ^or+l 


dA  . 


...10)  0=1  F(.X„)(X.X„)P(X) 


This  can  be  evaluated  by  a  contour  integral.  Only  terms  of  maximum 
r  2aT 

order  T  e  are  retained, 


ifta  >  01  i  a  VilVl-  . 

^  r'\  A  \«/  n  \  i-i/  \ 


Applying  Theorem  2  to  (2.11),  the  same  expression  is  obtained  for 
»^[a  >  0  ]  . 
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Next,  variance  M  kj^[a>  ol  vill  be  computea  for  estimate  (2.10). 


variance  M  k^^la  >  0] 


n  n 
E  Z  e 
0=1  p=i 


e-i 


v(j)/ 


iT(XcrXp)  E 

Lj=o  ■  aj  ■  ■ 
(i)‘^pk^^^^(T)  +  /  m^(T-v)  x(v)dvj 


T 

+  /  mQ(T-u)x(u)duj 


(3.»^7) 


=  2  E 


0=1  p=l 


®“i  e**i  .  ^  ^k 

E  E  (-i)J(i)^  e  ^^.R(u-v) 
i=o  k=o  Q'J  P^  ^uJ 


u=v=T 


+  E^  f  m-(T-v)  ^  R(u-v) 


J*=o 


aJ  o  P 


Su" 


dv 


U:^ 


du 


v=T 


+  /  ni^(T-u )inp(T-v )  R(u-v)dudv^ 


=,  E  E  — /"  f(A) 

0^1  p=i  ^  ^  — « 

(C,e+l) 


e-x  e-i 


^  2  S  (-l)J(l)>‘e  X 
1  k=o  aJ  ?k^J  ^k 


u=v=T 


dv 


u=T 
T  T 


+  Z  (i)^  /m  du-r/  /  m  (T-u)m  (T-v)e^^^'^~^^dudv^dX 

k=0  PKo  a  Vi:g.  o  o  0!  p  J 
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a*i  p=i  -»  ® 


+  /^  m  (T-u)e^^'^du]  [E-(X)e^^^  +  m-(T-v)e^'^^dv]clX  . 
o  ^  ^  o  ^ 


The  factor  In  the  square  bracket  will  be  examined  In  detail.  Using  the 
substitutions 

(C,e+J) 

-»"-ld  (p) 

.T  X  IXu.  1  ,  ® 

/  m^(T-u)e  du  dp  , 

where  d  >  0  ,  and  (2.8)}  the  bracket  can  be  written 


(3.W) 


+  f  m  (T-u)e^^’“du  =  ^  / 

Ct  a  i-co— 


1  .^la  .^»^(E^(p)^)^(p)) 


—Id  (ip-u) 
(C,e+1) 


dp 


4«»-id  ^ 
i  ,  ctr+l 

~Oir  J 


(3^9) 


2  ’'-00-id  p(-\T(p4>.  )F(p)(lp-iX) 
(C,e+1)  o  « 


dp 


Let  p  ,  •"»  Pk  zeros  of  F(p)  .  By  assumption  4Py>  0  .  The 

contour  In  (5.49)  will  be  closed  upward  around  the  poles  p  =  X,  -X^ 

•  •  • ,  Pj^  and  evaluated  by  residues^ 


B^(X)e^^^  +  /J  m^(T-u)e^^“du 


<!• 


Or+l 


IXT 


^-IXqT 


(X+X^)F(X) 


F(-X^)(X^+X) 


(3.50) 
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will 


Suimnatlon  on  7,  6,  and  v  le  understood.  The  constemts 
not  be  explicitly  evaluated.  Making  the  substitutions  (2.6)  and  (5*50)i 
the  integral  (5.47)  can  now  be  evaluated. 


variance  M  ki(a  >  0] 


n  n  JL  r"  r(MF(q 

2n  Jco  ' 


L  L 


0^1  P=1  F(-X^)F(-Ap) 


giT(Ap-Xa) 
+  — :r 


(C,e+1) 


F(-yFr^(\,+^)(y>^) 


C  C 

yfivQ!  y  _ 

V/X 


(3.51) 


+ 


glT(A+Aa) 

^-iT(AcyfA) 

(A-*A^)F(A)F(-Ap)(Ap+A) 

fTT)  (  A-P-Ap  )F(  -\)  (A^+A ) 

r  „&'  lT(A^y,) 

S'P’v'eT  , 

•y6va 

r  'Svp 

0  1 
76V  a 

dA 

F(-\)(A^+A)(P^,-A)’'' 

F(-Ap)(Ap+A)(py-A)'| 

Examination  of  the  residues  of  each  term  shows  that,  with  the  exception 

of  the  first,  all  terms  contain  one  of  the  factors  e  ^  P 
^iT(^y-^y)  ^  g“l^T(AQ;+py  )^  g^T(Ap+^)  ^  Thus  they  go  to  zero  as 

T  .  Hence,  the  asymptotic  expression. 


'^~Tr  2aT 

variance  M  kila>0J  ~e 


“  gr+l  pr+1 _ 


a=i  P=i  F(-A^F(-Ap)i(A^-Ap) 


(3.52) 
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is  provided  by  the  first  term  of  (3.51)-  Coniputation  of  variance 
MX[a>  0]  for  estimate  (2.11)  is  entirely  analogous  and  yields  the 
same  result. 


Asymptotic  expressions  (3-39)^  and  (3.52)  can  now  be  combined  to 

compute  the  efficiency  of  estimates  (2.10)  and  (2.11), 


variance  k^[a  >  O]  |4f(t)|^dt 


■T 
^  00 


(3.53) 


=  (variance  wF^'^[a>  0 ]){/■*■  |i|r(t)|%t) 


|lf[a>  0]|^ 


1  . 


.«grn 

Next,  M  [a  =  0]  will  be  computed  from  (2.13), 


RTU  =  01  . 


(3.5^) 


n  ^  ,  m  r  +1  *  .  +<w-ic 

=  £  E  ^  / 


The  dominant  terms  are  for  y  -  o.  and  j  =  r  +  1  ,  so  that 
~T  ,  .r4i  2r+i 


-  »-ic  (^+^Q ) ^ 
P 


dA 


(3.35) 
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For  estimate  (2.13)  the  variance  can  be  computed  as  follows: 


variance  M  k  la  -  OJ 

P  =  1  ''ct'"''  'P''  o  o 

r  y  ^ar4l^gr+l  1  X 


(2 


n  n 
=  £ 
a= 


d\ 


n  n  _  1  .00  , 

=  L  Z  «>  f  f(X) 

a=i  p^i  ocr-i^  pr4i  2it^co 


■  r  giT(A+Ao,)jr-j^,^_^^j  ^  r’(-l)^'^^ 

>0  (i^^+iA)•^■•"(r-j)’  (iA^4iA)^'^\ 


r  rl _ 

k=o  (iA_+iA)'^'^^(r-k)!  (iAp.+iA)^'*^ 

*-  P  P 


2r4 1 

Evaluating  each  term  by  residues,  the  only  terms  of  order  T  are 

the  cross  product  terms  for  a  =  P  .  The  contour  must  be  deformed 
around  the  poles  and  then  closed  upward  for  one  cross  product  term  and 
downward  for  the  other.  Thus  only  one  of  them  contains  the  poles  of 
interest.  As  before,  the  poles  of  F(A)  and  F(A)  will  contribute 
terms  containing  the  factors  '^Ot^  where  are  the  poles  of 

F(A)  and  hence  4(t,^)  >  0  ,  so  that  terms  of  this  type  are  asymptoti¬ 
cally  negligible.  Thus  the  asymptotic  expression  becomes 
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variance  M  k  la  =  OJ 


^  I'^or+l  I 

~  Z  -  - 


«="  tf(-A^)]^ 


a 


j=°  (j+r+l)',i^^j+^(r-j)! 


2r+i  n  I  or+i i 


Combining  expressicns  (3 . ^2),  (355),  and  (3  57),  it  is  seen  that 
estimate  (2.13)  is  also  efficient  This  concludes  the  proof  of 
Theorem  1. 


(3.57) 


Corollar:y  1 .  Under  the  assumptions  of  Theorem  1,  it  follows  that 

variance .k^[y(t),  0  g  t  §  T]  ^  ^  ^  ^  ^ 

variance  k'^ry(t),  -  »  <  t  s  T] 


Proof:  In  the  proof  of  Theorem  1  it  was  shown  that 


(variance  l?'[y(t),  0  g  t  g  T])  ]f(t)|^dt)  >  1 

—  CO 

and  hence 


(3.58) 


(3.59) 


Arp  _  . 

^  “^Q-riance  k  ly(t),  0  g  t  g  TJ  ^  variemce  k  [y(t),  0  g  t  g  tJ 

—  AiJ*  ”* 

variance  k  [y(t),  -  «  <  t  g  T]  variance  k  [^(t),  -  oo  <  t  g  T  ] 

(3.60) 

=  (variance  ’'^ry(t)^  0  g  t  g  T]}(/^  |i(t)  |^dt}  >  1  . 

—  00 

Corollary  2.  If  a  linear  unbiased  estimate  is  efficient  for  process  I, 
then  it  is  efficient  for  processes  II  and  III. 

Proof:  This  follows  immediately  from  Corollary  1. 
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Section  U 

EFFICIENCY  CLASSES 


In  this  section  a  necessary  condition  that  a  density  g  he  in  the 
efficiency  class  'f'CfjCp)  will  he  established. 


— T 

Lemma  1.  If  there  exists  a  linear  unbiased  estimate  k  that  is 


efficient  for  either  problem  I  or  II,  then  it  follows  that 


bj.  X 

a 


{h.l) 


where 

-  »  <  t  S  T]  ,  (4.2) 


=  /  e~^^^i(t)dt  , 

—  00 


(i^.3) 


arid 


b^ 


f  lf'^(k)!^dA  , 


i^.h) 


T 

a 


00 


|F(X)K'^(X)l^dA 


Proof:  Prom  reference  [2]  page  534  it  is  clear  that  the  linear  estimate 

— T 

k  can  be  represented  as 
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'k'^[y(t),  -  »  <  t  S  T]  =  ■^/V^^z'^(X)dX 

L  .00 

+  -i  /  e^^''^dz(X),  -  ®  <  t  g  T j  (4.6) 

^00 

=  k  ^  KT(x)zT(x)dX  +  ^  /“  KT(x)d2(X) 

>*00  —  00 

where  dz(X)  is  the  spectral  process  of  x(t)  with  E[|dz(A)|^] 

=  f(X)dA  ,  (X)  is  given  hy  (4.2),  and  Z^(X)  by 

Z^(X)  =  /^e“^^'^^(t)dt  =  <t'^(X)  +  H(X)  .  (4.7) 

—  00 

Unbiasedness  of  the  estimate  implies  that 

~  /"  K^(X)2?’(X)dX  =  1  .  (4.8) 

«  00 

— fT  /  \  T 

The  variances  of  the  estimates  k  and  k  [  T}(t)J  are  given  by 

variance^  "  linl^  ^  |K'’'’(X)12f(X)dX  (4_5) 

'  '  00 

=  7^  f  Ik'^(a)f(x)I  dx 

'  '  —  ee 

and 

l/variance  k^[Ti(t)]  =  /  j\if(t)  [^dt  -  rg^'g  /  |^'^(X)|2dX  (^.lO) 

—  00  '  '  —  00 

Applying  the  argument  used  to  derive  equation  (3  25)  to  equation  (3-8), 
it  is  seen  that 
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(i+.ll) 


Z^(A)  =  F(A)|^(X)  . 

-T 

From  Corollary  2  the  estimate  k  is  also  efficient  for  process  III,  so 


aV  .  I  ”.<  t  g  T]  ^  ^  asT>co. 

I  variance  k  [n(t),  -  <»  <t  ST] 


(4.12) 


The  result  can  now  he  derived  as  follows: 


rishi^ 

^  vT 
—  00  t 


X  2 


_  F(A)K^(X)'|r^'^(A)  _  F?XyK^(A)1 

J  j  “*  tji  IP  T 

—  ^  O'  J  L  ^  ^ 


(4.13) 


T r  F(A)l^(A)f'^(A)dX 


=  2  -  T  T  ^  ^ 

a  h 


Lemma  2.  If  there  exists  a  linear  unbiased  estimate  that  is  efficient 
for  either  process  I  or  II  and  for  the  two  problems  [q)(t),f(A)]  and 
['P(t),  g(A)];  that  is,  if  g  e  ,  then 


~  fp(A)G(A)  aj  f^(A)  F(A)  ^ 

f  m  “  ttT  m  ^ 

“hi  at  bl 

JJ  r  li 


dA  >  0  as  T  w  . 


(4.14) 


Proof:  From  an  elementary  inequality,  it  is  clear  that 
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«,  JJU)G(A)  aj  fJ(A)F(X)  ^ 

f  **  ^  fn 

-“  4  ^F 


^2  /  ^G(A)  -\- 

-“  L"f 


X)F(X)li®.  K^(X)G(X: 


Since  G(X)  and  F(A)  are  rational  functions  with  no  real  poles,  they 


are  bounded 


lG(X)|  ^  M 

If(X)I  ^  M 


for  X  real  and  hence 


CO  $p(X)G(X)  a^  -S -  ^ 

;  - G  $q(X)F(X)  dX 


From  (5-^0)  and  (3-^2) 


„2r  2a_  /„v  „2r+i_ 

(bp)  ~  T  e  Cp(T)  or  T  C 


where  C(T)  is  boimded  from  above  and  away  from  zero.  Prom  efficiency 


m  m 

a„b„  >1  so  that 
P  F 


T  T-(==^+i) 

*P  ~  0,(T)  “'■-0 - 
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Thus  the  expression 


is  bounded  £ind  the  result  follows  from  Lemma  1. 


(^^20) 


Theorem  3.  If  there  exists  a  linear  unbiased  estimate  that  is  efficient 
for  either  process  I  or  II  and  for  the  two  problems  [<p(X),f(X)]  and 
WO^),  ;  that  is,  if  g  e  ,  then  (i)  if  a  >  0  ,  it 

follows  that 


♦  0 


and  (ii)  if  a  =  0  , 


as  T  >•  »  y 


(^.21) 


a  =  1,  •••,  n  ,  (4.22) 


where  Cj  and  are  constants. 


Proof:  (i)  For  any  sequence  there  exists  a  subsequence  Tj,  and 

complex  numbers  9^^  such  that 


a 


l.OL 


as  y 


a  =  1, 


n 


(^^.23) 


For  this  subsequence,  it  follows  from  (5*27)^  and  (4.19)  that 
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.yTjX^sT, 


!i' 


G(X)  ^  cg-t-l’ 


0 


^V'«/  y  IMfJ. 

^Cp(Tj)  «=^  (X-*^^)Fr^ 


(1^.21^) 


VVV  n  Vu-i 

""c  Tt  ^  ^  - ^P\"V 

VTj)  0=^(X+X^)®^‘Vj 


0(X)  J  Vfl  J  V+l4 

>f^  0=1  (X-*Xq)  F(-Xj^)  Cq  0=1  G(-X^)(X^4 


From  Lemna  2  this  limit  must  be  Identically  zero. 


(11)  For  a  =  0  ,  the  integral  (4.lU)  will  be  evaluated.  Making  the 
substitution  f^(X)  =  Z^(X)  /f(X)  for  tp(X)  and  f^CX)  ,  this  becomes 


iF(X)G(X) 


aj  fJ(X)F(X) 


2 

dX 


G^F  F 


T,  T  G 


dX 


F 


(‘^.25) 


It  Is  clear  from  the  discussion  in  section  5  that  the  extension  of  cp(t) 
to  y{t)  depends  only  on  the  degree  of  F(X)  and  hence  the  same  exten¬ 
sion  Z^(X)  will  do  for  both  F(X)  and  g(X)  .  The  function  Z(X)  will 
be  evaluated  by  closing  the  contour  in  the  integral 
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+«-ic  (it^i)v)T 


(1^.26) 


upward  around  the  poles  of  'fi(ui)  and  uj  =  X  and  congesting  the  residues 


Z^(X)  =  «(X)  +  H(X)  +  £  £  -21J - -S-T-r^ - 

y=i  >1  (j«i)i  (<^X) 


u)  =  -Xy 


r  +1 
m  y 


=  ^jL75%7r 


(‘>.27) 


'p  ar+i  2r+i 

Since  "b*  ~  T  C  ,  terms  of  magnitude  T  must  he  found  in 
P  F 

(4.25).  Terms  containing  the  factor  H(X)  will  not  he  of  this  magni¬ 
tude.  Thus  making  the  substitution  (4.27)  (4.25) 


/ 

•*00 


)g(^  ^  ®'g  ~t 


Vg 


dX 


T  mm  ’V''^  _  00  I  ri/^ 


_  K  1  -iXT  (iT).V^^y^  (-1)1 

aJb^G(X)'  K?v+X^)^  ^=°  -I 


(X+X^)^ 


^0+1  i^o  ^ 


dX 


(4.28) 
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TenuB  of  inaxiimun  order  are  obtul  tiud  when  y  s  6  =  u  iiiitl  1  =  J  =  r  +  1 
for  the  cross  product  terjns.  The  contour  inuy  lx;  defoxiited  downward  airound 


the  poles  at  A  =  -  A  a  =  1. 

a  ’ 

T  T 

a(M  _  V^fO) 
FW-^TotXt 


Again  the  poles  of 


will  not  contribute  terms  of  maximum  order. 


Evaluating  by  residues, 

de 

/ 


ip(A)F(A)  dA 


T.T 


2ni 


CpT 


n 

Z 

0=1' 


1  A  1  2 

a(-x„) 

Cp  H-\) 

rar+i| 

11=0  £l{2r+l-Z)i 


2n 


n 

Z 


(2r+l)(rJ  0=1 


oor+i 


°(-V 

■  ‘'o 


From  Lemma  2  this  limit  must  be  zero,  and  since  $ _  ,  ♦  0  ,  it  follows 

'  or+i 

that  (4.22)  is  satisfied. 


In  the  case  a  =  0  ,  it  is  clear  that  the  estimate  k[y(t),  OstsT; 
a  =  O]  given  by  (2.13)  is  efficient  for  all  densities  g(A)  which 

satisfy  condition  (ii)  of  the  theorem.  Thus  condition  (ii)  is  a 
necessary  and  sufficient  condition  that  6(A)  belong  to  ?'(f,<p)  t 
the  estimate  (2.13)  is  economical  of  information  concerning  the  spec¬ 
trum  f(A)  in  the  sense  that  there  exists  no  linear  unbiased  estimate 
which  is  efficient  for  a  wider  class  of  spectra. 
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In  the  case  a  >  0  ,  the  theorem  states  that  if  there  exists  an  estimate 


that  is  efficient  for  r<p(t),f(A)]  and  [<p(t),  g(X)]  on  a  sequence 
for  idiich  condition  (U.23)  holds,  then  the  spectra  f(X)  and  g(X)  are 
related  hy 

C  2  ^ocr+l^a _ 1  y  ^ar-fl4t  /,  „ 

^0(=i  g(-xJ(X+X^)  ^^Q!=aF(-X^)(X+X^) 

For  n  =  1  or  2  it  can  he  shovn  that  this  implies  that  g(X)  =  Cf(X) 
and  hence  that  estimates  h'^[y(t),  0$tsT;  a>OJ  and 

0  S  t  S  T;  a  >  O]  given  hy  (2.10)  and  (2.11)  are  economical  in 
the  same  sense  stated  above  for  £^[y(t),  0  S  t  S  T,  a  =  o]  .  However, 
the  economy  of  estimates  (2.10)  and  (2.11)  is  of  little  practical  interest 
since  it  is  very  unlikely  that  the  function 


1  ^  ^  '^ctr4l4t 
0=1  f(-x^T(x+x^) 


(^.31) 


would  in  practice  he  known  if  the  function  F(X)  were  unknown.  This  is 
not  the  case  for  the  estimate  £^Cy(t),  OStST;  asO]  ,  since  here  a 
real  reduction  is  made  in  the  information  required  concerning  f(X)  . 

#JP 

For  instance,  if  n  =  1,  k  [a  =  O]  becomes  essentially  the  least  square 
estimate  and  no  knowledge  of  f(X)  is  required.  For  larger  n  ,  if  the 
spectrum  is  to  he  estimated  from  an  independent  experiment,  then  estimates 
need  only  he  made  at  the  frequencies  of  interest  -X^,  a  =  1,  • • • ,  n^ 
considerably  reducing  the  work. 
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— T  T 

For  the  least  square  estimate  given  by  (l.U),  the  function  K  (A) 

becomes 


k'^(a) 


f  |cp{t)12dt  f  |(p(t)|2dt 


(^.32 


For  the  pseudo-spectral  density  g(X)  =  1  , 

=  Z^(A)  =  +  H(A)  , 


(^.33 


and 


K^(A) 


fJ(A)  -  H(Ay 
f  |cp(t)|2dt  ■ 


(h.3^ 


If  this  expression  is  substituted  in  (4.1)  and  terms  involving  H(A) 
are  neglected,  the  expression  (4.25)  is  obtained  and  the  following 
results  concerning  the  least  square  estimate  can  be  deduced:  If  the 
least  square  estimate  is  efficient  for  f(A)  ;  then  if  a  =  0 

=  ^2  a  =  1,  "•*,  n  ,  (4.35 


and  if  a  >  0 


1 

f(aT 


$  -.e 

or+l 


LT^(Aa) 


n 

Z  _  _ 

a=i  f(^(A+A^) 


n 

Z 

a=i 


ar+l 


LT^(Aa) 


(A+A  ) 
a' 


>0 


(4.36 


as  T  >  00  . 
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A  MODEL  OF  TURBULENCE 


DISPLAYING  AUTONOMOUS  OSCILLATIONS 


R.J.  Dickson 
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the  A.  M.  S.  ,  Vol.  5,  No.  7,  Dec  1958,  pp.  822-823. 


iii 


LOCKHEED  AIRCRAFT  CORPORATION 


MISSILES  and  SPACE  DIVISION 


ABSTRACT 


This  report  presents  the  results  of  the  author's  attempt  to  find  and  treat 
simplified  examples  of  problems  governed  by  nonlinear  space-time  systems. 
The  work  was  carried  out  imder  the  General  Research  Program  of  Lockheed 
Missiles  and  Space  Division. 

The  introduction  sketches  the  general  setting  of  a  "Model  of  Turbulence"  in  the 
sense  of  J.  M.  Burgers,  and  directs  attention  to  the  character  of  the  particular 
problem  considered  here. 

Section  2  introduces  a  special  class  of  nonlinear  ordinary  differential  equations 
and  explains  its  relationship  to  a  partial  differential  system  studied  by  Burgers. 

Section  3  takes  the  first  step  in  a  detailed  study  of  this  class  of  equations. 
Specifically,  it  is  proved  that  a  certain  fourth-order  autonomous  system  of 
crdinaiy  equations  possesses  an  unstable  equilibrium  solution  and  a  stable 
periodic  solution;  and  that  with  increasing  time  all  other  solutions  tend 
asymptotically  to  one  of  these  two  possibilities . 
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Section  1 
INTRODUCTION 


Hydrodynamic  turbulence  is  a  variety  of  parasitic  oscillation.  Its  qualitative 
features  are  easily  observed  in  nature  and  in  the  laboratory,  yet  their  mathematical 
description  has  remained  for  many  years  at  a  primitive  level.  Almost  no  progress 
has  been  made  toward  proving  -  or  disproving  -  that  the  dynamical  (Navier-Stokes) 
equations  of  fluid  mechanics  possess  solutions  consistent  with  observed  turbulent 
flows. 


To  gain  insight  into  the  mathematical  structure  of  turbulence,  J.  M.  Burgers 
initiated  in  1939  the  study  of  "models  of  turbulence"  (Refs.  1  and  2).  Typically, 
one  of  Burgers'  models  is  a  mathematical  problem  which  preserves  selected  fea¬ 
tures  of  the  hydro<tynamical  problem  yet  remains  simple  enough  to  admit  analytical 
study;  it  does  not  correspond  to  any  real  physical  problem,  but  is  studied  formally 
as  a  mathematical  problem  for  its  suggestiveness  and  for  possible  insight  into  the 
nature  of  its  more  complex  parent. 


The  relation  of  Burgers'  work  to  the  existing  literature  on  the  subject  of  "isotropic 
turbulence"  may  be  clarified  by  the  following  quotation  (Ref.  1,  p.  47): 

It  should  be  observed  that  the  theories  developed  by  Taylor, 
von  Karm^  and  others  are  concerned  mainly  with  a  different 
problem,  viz.,  the  character  of  the  turbulence  foimd,  e.g. , 
in  an  air  current  in  the  wake  of  a  grid,  and  its  gradual  decay. 

These  theories  do  not  consider  the  side  of  the  problem  to 
which  attention  has  been  given  here,  viz. ,  the  development  of 
a  dissipative  secondary  phenomenon,  which  grows  by  detracting 
energy  from  a  given  primary  phenomenon  and  in  this  process 
gains  such  an  intensity,  that  finally  a  balance  is  obtained  be¬ 
tween  the  energy  detracted  and  the  energy  dissipated.  This  lat¬ 
ter  subject  is  the  one  which  has  been  traced  here  through  stages 
of  successive  complexity  .  .  .  and  it  will  be  clear  that  it  has  a 
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generality  much  wider  than  the  field  of  hydrodynamics.  The 
problem  can  be  compared  to  that  of  the  development  of  relaxa- 
tional  oscillations,  investigated  by  van  der  Pol.  The  classical 
example  treated  by  van  der  Pol,  however,  refers  to  a  system 
with  a  single  variable,  whereas  the  present  case  is  character¬ 
ized  by  the  appearance  of  a  dissipative  secondary  phenomenon 
embracing  an  infinite  number  of  degrees  of  freedom  .  .  . 


Referring  to  phase  flows  generated  by  the  Navier-Stokes  equations,  E.  Hopf  (Ref.  3) 
described  the  motivation  for  the  study  of  models  when  he  wrote: 

How  do  the  solutions  which  represent  the  observed  turbulent 
motions  fit  into  the  phase  picture  ?  The  great  mathematical  dif¬ 
ficulties  of  these  important  problems  are  well  known  and  at 
present  the  way  to  a  successful  attack  on  them  seems  hopelessly 
barred.  There  is  no  doubt,  however,  that  many  characteristic 
features  of  the  hydrodynamical  phase  flow  occur  in  a  much 
larger  class  of  similar  problems  governed  by  nonlinear  space- 
time  systems.  In  order  to  gain  insight  into  the  nature  of  hydro- 
d3mamical  phase  flows  we  are,  at  present,  forced  to  find  and  to 
treat  simplified  examples  within  that  class.  The  study  of  such 
models  has  been  originated  by  J.  M.  Burgers  in  a  well  known 
memoir  .  .  . 


This  paper  presents  results  of  the  author’s  attempt  "to  find  and  to  treat  simplified 
examples  within  that  class.  "  More  specifically,  the  attempt  was  to  find  a  model 
which  compromised  the  shortcomings  of  two  of  Burgers'  models.  The  first  of  these, 
which  will  be  called  here  the  "preliminary  model,"  is  the  following  differential 
boundary  value  problem  for  a  (real  valued)  function  u(x,t)  defined  for  positive 
"time,"  t>0  ,  on  the  one-dimensional  "space,"  Os  x  si  ; 

u,  =  ru  +  u  -  2uu  (1,1) 

t  XX  X  \  / 

u(0,t)  =  u(l,t)  =  0  ,  t>0  (1.2) 

The  subscripts  here  denote  partial  derivatives,  while  v  is  a  positive  constant  which 
Burgers  calls  the  viscosity.  In  Burgers'  "interpretation,"  the  function  u  is  thought 
of  as  the  turbulent  perturbation  of  an  Eulerian  velocity  field;  thus,  the  obvious  solu¬ 
tion  u(x,t)  =0  is  referred  to  as  the  "laminar"  solution.  If  Eq.  (1. 1)  is  multiplied 
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by  u  and  integrated  over  Os  x  <1  ,  the  contribution  of  the  nonlinear  term 
vanishes  because  of  Eq.  (1. 2)  The  contribution  of  the  viscosity  term  can  be  inte¬ 
grated  by  parts,  yielding  formally  the  relation 

111 
I  ^  y"  u^  dx  =  -1/  y*  u^^  dx  +  y  u^  dx  (1.3) 

o  o  o 

which  Burgers  calls  the  "energy  equation"  because  it  reveals  that  the  "energy  of 

.1  2 

turbulence,"  J  u  dx  ,  tends  to  be  diminished  (or  dissipated)  by  the  viscosity 
o 

term  and  increased  by  the  "driving"  term  u  of  Eq.  (1. 1),  which  plays  a  role  analo¬ 
gous  to  the  pressure  gradient  term  of  the  Navier-Stokes  equations.  (The  question 
of  which  of  these  influences  predominates  is  discussed  in  Ref.  4,  where  it  is  shown, 
in  particular,  that  Eq.  (1. 1)  belongs  to  a  class  of  equations  for  which  a  modified 
maximum  principle  holds. ) 

If  V  is  sufficiently  large,  it  can  be  proved  that  the  viscous  dissipation  is  prec 
nant;  i.e. ,  all  solutions  tend  to  the  stationary,  laminar  solution  as  t— w  .  As 
is  decreased,  however,  a  positive  critical  value  is  reached  at  which  this  total  sta¬ 
bility  of  the  laminar  solution  is  lost.  On  the  basis  of  hydrodynamical  evidence,  it 
would  be  reasonable  to  conjecture  that  this  loss  of  stability  marks  the  entry  into  a 
"turbulent  regime"  in  which  a  time -dependent  periodic  solution  appears  and  that 
further  decrease  of  v  will  lead  to  successive  branchings  at  which  more  and  more 
complicated  oscillatory  solutions  become  possible.  (For  a  detailed  conjecture  on 
the  nature  of  the  branching  process  and  illustrative  examples,  see  Ref.  3.)  So  far 
as  is  known,  however,  the  model  under  consideration  is  not  sufficiently  sophisti¬ 
cated  to  display  such  behavior.  A  branchir^  process  does  indeed  occur,  but  the 
"turbulent"  solutions  whi'bh  appear  are  stationary,  i.  e. ,  time  independent.  (For 
details  see  Refs.  1  and  2.) 

In  constructing  a  more  complicated  turbulence  model.  Burgers  ingeniously  excluded 
the  possibility  of  stationary  solutions  other  than  the  laminar  one.  Specifically,  he 


1-3 


LOCKHEED  AIRCRAFT  CORPORATION 


MISSILES  and  SPACE  DIVISION 


proposed  the  system 


=  VV  +  V  ~  W  +  (w  -  V  ) 
t  XX  '  'x 

(1.4) 

w^=iw  +V+W  +  (2vw) 
t  XX  '  'x 

for  two  functions  v(x,t)  ,  w(x,t)  on  0<x<l,  tsO  subject  to 

v(0,t)  =  v(l.t)  =  w(0,t)  =  w(l,t)  =  0  ,  t>0  (1.5) 

It  is  convenient  to  combine  Eqs,  (1. 4)  into  one  equation  for  the  complex  valued 
function 

u(x,t)  =  v(x,t)  +  iw(x,t) 


Burgers'  model  is  then 


a  =  pu  +  (1  +  i)  u  -  2  uu  ,  u  =  V  -  iw 

t  XX  .  '  '  X 


u(0,t)  =  u(l,t)  =  0 


(1.6) 

(1.7) 


Multiplication  of  Eq,  (1.6)  by  u  ,  integration  over  0  s  x  s  1,  and  integration  by 
parts  yield 


XX  X 

J  =  -v  J  (u^  p  dx  +  (1  +  i)  y"  |u( 


dx 


(1.8) 


The  real  and  imaginary  parts  of  Eq.  (1. 8)  are 


J.  JL  X 
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and 


2i 


(1.10) 


Equation  (1.9)  is  the  energj'  equation,  exactly  analc^ous  to  Eq.  (1.3);  Eq.  (1. 10) 
shows  that  u  =  0  is  the  only  solution  independent  of  the  time.  If  the  model  has 
any  turbxilent  solutions  at  all,  they  are  not  stationary. 

But  are  there  any  turbulent  solutions?  As  with  the  preliminary  model,  it  is  found 
that  the  laminar  solution  loses  stability  below  a  positive  critical  value  of  viscosity. 
For  subcritical  v  ,  there  is  presumably  a  turbulent  regime.  Although  Burgers 
presented  heuristic  arguments  favoring  the  existence  and  boundedness  of  solutions 
for  all  time,  essentially  nothing  is  known  with  certainty  about  the  behavior  of  solu¬ 
tions  in  the  turbulent  range  of  u  . 

Whereas  the  preliminary  model  was  too  simple  to  display  time-dependent  turbulent 
solutions,  this  model  is  not  simple  enough  to  permit  analysis.  It  was  an  attempt  to 
formulate  models  of  intermediate  complexity  which  generated  the  results  of  this 
paper. 
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Section  2 

A  MODIFIED  MODEL 


INTRODUCTION 

Numerical  approximations  to  the  solution  of  a  partial  differential  boundary  value 
problem  are  ordinarily  obtained  with  digital  or  electrical  analog  computers.  Such 
machines  will  accept  the  problem  only  in  a  modified  or  analogous  form.  For  a 
digital  computer,  the  problem  must  be  replaced  by  one  of  its  approximate  "dis¬ 
cretized"  versions  requiring  only  algebraic  operations.  For  an  analog  machine, 
the  discretization  need  not  be  complete;  in  a  space-time  system,  for  example,  it  is 
sufficient  to  "lump"  space  alone,  leaving  time  as  a  continuous  variable. 

In  eitlier  case,  metamorphosis  of  the  problem  requires  that  functions  of  a  contin¬ 
uous  variable  be  replaced  by  functions  defined  on  a  finite  set  of  points,  and  that 
derivatives  be  replaced  by  algebraic  differences.  There  is  freedom  of  choice  in  the 
mmiber  "n"  of  points,  i.e. ,  the  "fineness"  of  the  discretization,  and  in  the  par¬ 
ticular  difference  scheme  employed.  No  criteria  for  making  these  choices  are  dic¬ 
tated  by  the  original  differential  problem.  Mathematical  problems  of  considerable 
depth  arise  from  the  questions  of  how  to  make  these  choices  in  an  optimal  way  and 
how  to  assess  the  goodness  of  the  resulting  approximation. 

The  more  or  less  arbitrary  choice  of  a  particular  difference  scheme  is  often  justi¬ 
fied  by  the  observation  that  the  distinction  between  formally  admissible  schemes 
vanishes  as  n—  <»  ,  i.  e. ,  in  the  limit  of  refinement  of  the  discretization.  It  may 
occur,  however,  that  for  small  values  of  n  certain  special  schemes  are  most  ef¬ 
fective  in  preserving  some  kind  of  analogy  between  the  exact  and  approximate  prob¬ 
lems.  In  discretizing  the  "space"  Os  x  <1  involved  in  Burgers'  model  Eq.  (1.6), 
the  most  obvious  and  straightforward  choice  is  poor  since  it  does  not  yield  for  each 
n  an  exact  analog  of  Eq. (1.8). 
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Specifically,  let  Uj^{t)  denote  an  approximation  to  u(k/n,t)  n-1 

and  set  UQ(t)  =  u(0,t)  =  0  ,  u^(t)  =  u(l,t)  =  0  .  Suppose  the  Uj  to  be  solutions 
of  the  following  system  of  ordinary  differential  equations  obtained  from  Kq.  (1. 6)  by 
replacing  spatial  derivatives  by  central  divided  differences: 


^  - - 


1  ~  \- 


k  =  1  ,  .  .  .  ,  n  -  1 


Multiplying  by  Uj^  and  summing  on  k  ,  it  is  easily  found  that  the  exact  analog  of 
Eq.  (1.  8)  does  not  hold  because  the  contribution  from  the  nonlinear  terms  does 
not  vanish  identically,  i.  e. , 

n-1 

S  “  • 

k=l 


In  the  attempt  to  find  an  approximating  system  without  this  imperfection,  it  has  been 
found  sufficient  to  approximate  the  function  H  in  the  nonlinear  term  u  u^  by  the 
average  ^  \  of  the  resulting  system  will  be  sum¬ 

marized  as  Theorem  2- 1- 
THEOREM  2.1: 

If 

»;  >  0  ,  n  ^  2  ,  u^  =  u^  =  0  ,  (2. 1) 


the  system 


+  Vi>  ^  - 1  <\-i  ^  \  ^  Vl>(Vl  -  Vl>  <2.2) 

for  the  (complex  valued)  functions  Uj^(t) ,  k  =  1,2,  ,  .  .  ,  n-1  has  the  following 
properties: 
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•  Every  solution  satisfies  the  relations 


k=rl  k«l 

k»l 

n-1  n-1 

■“2!  \\  =  s  2  ‘\"k 

k=l  k=l 

n-1 

=  2  i“ki^ 

where 

\  =  l\l  exp(iy 

•  The  solution  taking  arbitrarily  assigned  initial  values  exists  uniquely  and 
is  bounded  over  any  finite  time  interval  0-$  t  T  <  «>  . 

•  Uj^(t)  =  0  ,  k  «  1,2,  .  .  .  ,  n-1  is  the  only  time-independent,  i.e. ,  equi¬ 
librium,  solution.  Define  v  ^  f  2n  sin  (m 7r/2n)  1  m  =  l,.  .  n-1. 

•9  inij 

(As  n  —  00  —  tt”  »  •'n  In  ^  ^  ^  ’^In  '  solution  approaches 

the  zero  solution  ast— «.  IfO<v<v  ,  ,  every  nonzero  solut.ou  ^p- 

n-in 

proaches  infinity,  i.  e. , 


vi'’’  ^  2 

k=l 


(2.3) 


n-1 


“kV=2  '"k'^  ®k 


k=l 


(2.4) 


n-1 

2  as  t  —  00  . 

k«l 

2-3 


LOCKHEED  AIRCRAFT  CORPORATION 


MISSILES  ond  SPACE  DIVISION 


Proof. 


1. 


After  multiplying  Eq.  (2. 1)  by  Uj^  and  summing  on  k,  Eqs.  (2.  3)  and 

(2.4)  follow  from  the 

Lemma:  IfUj^,k  =  0,l,...,n  are  complex  numbers  with  =  u 


n 


=  0 


f 


then 


and 


n-1 


2 


"k  <Vi  -  \-i» 


n 


SK-v/ 


(2.5) 


n-1 


I 


(2.6) 


Equation  (2.5)  follows  at  once  from  the  Abel  partial  summation  formula,  quite 
analogously  to  the  integration  by  pax'ts  used  in  obtaining  Eq.  (1.8).  Equation 

(2. 6)  follows  from  the  observation  that  the  k—  summand  can  be  written  as 

“k  -  “k-i  ^  4 ' 4- 1  ’  “k  '  "k\ti  •  4  ‘  \!i  \  • 

sum  telescopes  to  a  ,-q:  +  B  -  0. 

^  n-1  o  ^n-1  '^o 

2.  It  is  apparent  from  classical  theorems  that  a  unique  solution  to  the  initial  value 
problem  exists  for  some  interval  0  <  t  <  T  .  That  T  may  be  assumed  ar¬ 
bitrarily  large  follows  from  the  classical  extension  theorem  once  it  is  shown  that 

n-1 

c(t)  =  2  lukl^ 
k=l 

is  bounded  over  any  finite  interval.  To  prove  this,  observe  that  the  right  member 
of  Eq.  (2. 3)  is  a  hermitian  form  (call  it  H)  in  the  variables  u^  ,  . . .  ,  u^  j. 
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In  the  sequel,  it  will  aj^ear  that  the  eigenvalues  of  this  form  are  given  explicitly, 
in  decreasing  order,  by  1  -  8in^(k7r/2n)  ,  k=l,2,  ....  n-1  .  By  a 

well-known  characterization  of  the  least  and  greatest  eigenvalues. 


A 


n-1 


c  <  H  ^  XjC 


yielding,  from  Eq.  (2.3),  the  differential  inequality 


^n-1  ^  2  ^ 


Integration  gives 

2A  _  t  2A  t 

c(o)  e  ”  ^  c(t)  <  c(o)  e  (2. 7) 

which  exhibits  the  desired  bound. 

3.  The  uniqueness  of  the  equilibrium  solution  is  obvious  from  Eq.  (2.4),  The 

remaining  assertions  are  implied  by  Eq.  (2.7)  upon  noting  that  v  >  implies 

A,  <  0  and  v  <  ■,  implies  A  ,  >  0  , 

1  n-1 n  n-1 

As  in  the  models  considered  by  Burgers,  Eq.  (2.  2)  exhibits  a  "laminar"  regime,  ^  I'jn  ’ 
in  which  all  solutions  tend  to  a  totally  stable  unique  laminar  solution.  For  fixed  n 
and  small  v  ,  however,  the  model  encounters  a  deficiency  inherent  in  finite  discretiza¬ 
tion  schemes.  Of  necessity,  a  function  defined  on  a  finite  mesh  cannot  adequately 
approximate  spatial  fluctuations  on  a  scale  smaller  than  the  mesh  spacing;  in  a  model 
of  turbulence  like  Burgers’ ,  however,  fluctuations  on  a  finer  and  finer  spatial  scale 
are  to  be  expected  with  diminishing  viscosity.  It  is,  therefore,  not  surprising  that 
solutions  of  a  model  like  Eq.  (2.  2)  are  found  to  "blow  up"  if  v  is  too  small.  As  a 
consequence,  it  is  apparent  that  Eq.  (2.2)  can  display  "interesting"  behavior  only 
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as  a  lower  bound 


in  some  range  v'<  v  <  ,  and  then  Eq.  (2.1)  gives 

on  V  . 

In  the  case  n  =  2,  the  model  is  essentially  trivial,  since  Eq.  (2.2)  reduces  to  a 
single  linear  equation  and  the  interval  of  interest  degenerates  to  a  point,  i.e,  , 
p  ,  =  i;,  .  It  is  suggestive,  however,  that  for  v  =  v,  ,  all  solutions  are  periodic. 

The  simplest  nontrivial  case,  n  =  3  ,  will  be  analyzed  in  Section  3.  In  the  remainder 
of  this  section,  the  effect  upon  Eq.  (2.2)  of  a  particular  change  of  dependent  variables 
will  be  described. 

One  of  the  tools  used  by  Burgers  in  his  study  of  Eq.  (1.  8)  and  similar  equations  is 
the  expansion  of  the  solution  function  in  Fourier  series;  the  partial  differential  equa¬ 
tion  yields  formally  an  infinite  system  of  ordinary  differential  equations  for  the 
determination  of  the  Fourier  coefficients  as  functions  of  time.  At  a  fixed  time,  the 
coefficients  may  be  called  a  "spectrum"  which  describes  the  harmonic  content  of 
the  spatial  distribution  of  "turbulent"  fluctuations;  the  significance  of  this  spectrum 
and  various  questions  concerned  with  its  change  with  time  are  discussed  in  Burgers' 
papers  (Refs.  1  and  2). 

To  obtain  an  analogous  (but  finite)  spectrum  for  the  discretized  model  Eq.  (2.  2),  the 
solution  Uj^(t)  may  be  expanded  (with  respect  to  the  spatial  variable  k)  in  a  finite 
sine  series;  in  other  words,  new  complex  coordinates  may  be  introduced  in  place 
of  the  (complex,  n  -  1  dimensional)  phase  space  of  the  system  (2.  2) 

by  means  of  the  transformation 

n-1 

^  sinj  ^  ,  k=l,2 . n-1  (2.8) 
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;  kjr 


Abbreviating  sin  i  —  a,  .  and  noting  the  orthogonality  relation 
n  K] 


n-1 

I 

k=l 


°kj  '  2  ®1J  ' 


n/2  i  =  j 
0  i  ^  j 


the  inverse  transfornation  is  easily  found  to  be 


n-1 


■i’l  1  "k  ‘'ki 


(2.6) 


k=l 


Inserting  Eq.  (2.  8)  in  Eq.  (2.  2)  and  performing  algebraic  reductions  lead  to  the 
following  systems  of  equations  for  the 

S  ‘^i'i+k  ^i^i+k'6  2  ^i'k-i  ^i  ^k-i  (2.10) 

l^i<i+k<n-l  l<i,k-i<n-l 

k  =  1,2,  ...  ,  n-1 

where 

a.,  ~  sin  j  -  (1  +  2  cos  k  — )  -  sin  k  -  (1  +  2  cos  j  -  ) 

B.,  =  sin  i  -  (1  +  2  cos  k  — )  +  sin  k  —  (1  +  2  cos  j  -  0 

^k  •'  n  '  n  '  n  '  ■'  n 

.  -  .  2  .  2  kir 

\^=l-4^n  sin  - 

i.e. ,  the  linear  part  of  the  system  has  been  diagonalized  by  the  change  of  coordinates. 
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Section  3 
THE  CASE  n  =  3 


INTRODUCTION 

If  n  =  3,  Eqs.  (2.2),  (2.8),  (2.9),  and  (2. 10)  have  the  following  forms; 


Uj  =  (1  -  18v  +  i)  Uj  +  9v  U2  -  (Uj  +  Ug) 

^2  =  ^1  +  (1  -  18i;  +  i)  Ug  +  (u  +  U2)  u 


*^1  "  2  ^2^ 
^^2  *  ”2~  ”  ^2^ 


^1  '  yTT  “2^ 


h^TT  <^i  -  '^2) 


(3. 1) 


(3.2) 


(3.  3) 


=  (1  -  9v  +  i)  +  VT  ^2 

The  most  evident  property  of  the  system  (3.4)  will  be  stated  as  Theorem  3.1. 
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THEOREM  3.1 


If  £,(t)  .  £_(t)  is  a  solution  of  Eq.(3.4)  in  an  interval  containing  a  point  t  such 

Jt  M  O 

that  “  0  ,  then 

£l(t)  5  0  ,  ^(t)  =  ^(t^)  exp  [(1  -  27v  +  i)  (t  -  t^)]  (3. 5) 

is  the  unique  extension  of  the  given  solution  to  -oo  <  t  <  +  <*>. 

This  theorem  implies  that  the  phase  point  sets  ~  0 

invariant  manifolds;  solution  trajectories  never  pass  from  one  to  the  other.  Solutions 
of  the  form  Eq.  (3. 5) will  be  subsequently  referred  to  as  trivial.  For  v  >  1/27  ,  it  is 
apparent  that  all  trivial  solutions  approach  the  zero  solution. 

Because  of  the  complex  notation,  Eq.(3.5)  is  a  convenient  condensation  for  a  system 
of  four  first-order  differential  equations  for  four  real-valued  functions.  For  dis¬ 
cussion  of  the  nontrivial  solutions,  real  coordinates  will  be  introduced  which  permit 
the  reduction  of  the  initial -value  problem  for  Eq.  (3. 4)  to  the  initial -value  problem  for 
a  system  of  three  real  differential  equations  plus  a  quadrature. 

THEOREM  3.2 

The  solution  of  Eq.  (3. 4) taking  initial  values  £^(0)  0  ,  given  by 

fi(0)  {  \ 

°7gT£iT6)l  1^4, 

(3.6) 


^2^^^  “  i  y  w]  exp|-  2ijr  [i  -  y(s)]  dsj 
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in  which  x(t) ,  y(t)  ,  z  (t)  is  the  solution  of  the  system 


X  =  (1  -  27j/)x  -  3y  -  z 

+  2y2 

(3.7a) 

y  *  3  x  +  (1  -  27 1;)  y 

-  2  xy 

(3.7b) 

z  »  2(1  -  9i/)  z 

+  2  X  z 

(3.7c) 

satisfying  the  initial  conditions  dictated  by  the  special  case  t  =  0  of  the  transformation 
relations 


_  ?i(t) 

x(t)  +  i  y  (t)  =  VT  --  —  ,  z(t)  =  3  I  f-(t)  (2  (3.8) 

Sl(t) 

Proof,  The  proof  consists  of  straightforward  verification  and  will  be  omitted.  In  its 
place  the  formal  processes  leading  to  Eqs.  (3. 7)will  be  sketched.  ("Formal"  here 
refers  specifically  to  the  fact  that  possible  discontinuities  of  the  function  (t) 
defined  below  are  ignored. ) 

Into  Eqs.  (3.4)  introduce  real  polar  coordinates  by 

,k  =  l,2  (3.9) 

The  following  differential  equations  are  found  for  p,  ,  <p,  : 

K  K 

Pj^  =  (1  -  9v)  p^  +  cos  (2(p^  + 

2 

^2  -  27 jy)  P2  -  Pj  cos  {2(p^  +  (p ^  (3.10) 
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Pi  'Pi  =  Pi  "  Pi  P2  sin(2(^i  + 

P2  ^2  “  ^2 


By  defining  a  new  variable  ip  -  2(p^+  <p^  ,  Eqs.  (3. 11)  yield  formally 


=  3  - 


sin  ip 


(3.12) 


Now  Eqs.  (3. 10)  and  (3. 12)  constitute  a  third-order  system  for  Pi »  P2  •  ^  fimctions 
of  t.  Given  a  solution  of  this  system  the  functions  <p^  ,  (P^  can  be  found  by  quadra¬ 
ture  from  Eqs.  (3.11). 


The  system  of  Eqs.  (3.7)  now  results  from  transformation  of  the  system  (3.10) 
anci  (3. 12)  according  to 


X  =  pg  cos  ip 
y  =  P2  sin  Ip 


The  combined  transformation  from  Eqs.  (3.4)  to  (3.  7)  is  given  by  Eq.  (3. 8).  The  first 
of  Eqs.  (3. 11)  yields 


^Pl(t)  =  V’i(o)  [1  -  y(s)] 


ds 
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or 


{jW  ti(o) 

expj^[l-y(s)]  dB 

The  transformation  employed  can  be  interpreted  so  as  to  yield  also  a  correspondence 
between  the  trivial  solutions  of  Eq.  (3.4)  and  solutions  of  Eqs.  (3.7)  with  z(t)  =  0  ; 
simplicity  is  lost  in  the  process,  however,  since  the  restriction  of  Eq.  (3.4)  to 
t j  =  0  leaves  a  linear  system  while  the  restriction  of  Eqs.  (3. 7)  to  z  =  0  is  non¬ 
linear. 

It  is  easily  seen  that  in  the  (invariant)  half-space  z  >  0  the  system  (3. 7) 

can  have  at  most  one  critical  point.  In  fact,  setting  the  right-hand  members  equal 

to  zero,  Eq.  (3.7)  implies 


X  »  X  *  Sj'  -  1 

c 


Eq.  (3. 7b)  then  requires 


y  =  y. 


1  -  9v 
1  -  15v 


and  Eq.  (3.7a)  requires 

z  =  z^  =  (1  -  9v)  (27v  -  1)  [  1  +  (1  -  15v)"^  ] 

This  point  (x  ,  y  ,  z  )  belongs  to  the  half-space  z  >  0  if  v  lies  in  either  of  the  intervals 
c  c  c 

1/27  <  V  <  1/15  or  1/15  <  v  <  1/9.  That  the  first  interval  represents  values  of  v  too  small 
to  yield  "interesting"  results  in  the  case  n  =  3  is  suggested  by  the  following  negative  result. 


THEOREM  3.3 

If  1/27  <  V  <  1/15  ,  every  solution  of  Eqs.  (3. 7)  which  intersects  the  region  z  >  0  , 
y  <  3/2  is  unbounded  as  t  —  +  «  . 
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Proof.  Using  Eqs.(3.  7),  it  is  readily  verified  that  for  any  solution 


dt 


-3(1  -  15i^)t1  o 
e  j=|(l-27,.) 


-3(1  -  15i/)t 
z  e 


For  a  nontrivial  solution  (z(t)  >  0)  and  v  >  1/27  ,  this  implies 


dt 


-3(1  -  15v)t  j 


<  0 


Hence  for  t^^  <  t^ 


(y(t2)  - 1) 


<  ^  (tj)  (y(ti)  - 1) 


-3(1  -  15  V)  (tj  -  tg) 


If  y(t j)  -  3/2  <  0  ,  J/  <  1/15  ,  and  z  (t)  is  bounded,  this  inequality  implies 
y(t2)  —  -  <»  as  t^  —  «>  .  Under  the  hypothesis  of  the  theorem,  therefore,  at  least 
one  of  the  functions  z  (t)  ,  y(t)  is  not  bounded. 


Against  this  negative  result  for  1/27  <  v  <  1/15  ,  the  theorem  following  reveals 
interesting  behavior  for  solutions  of  Eqs.  (3. 7)  for  at  least  a  part  of  the  interval 
1/15  <  V  <  1/9  . 

THEOREM  3.4 

There  is  a  number  v  ,  1/15  ^  v  <  0.0836  <  1/9  such  that  for  v  <  v  <  1/9  every 

nontrivial  solution  of  the  system  (3. 7)  approaches  the  equilibrium  solution 

(x  ,  y  ,  z  )  as  t-*  00;  Eqs.  (3.6)  then  imply  that  c  ’ontrivial  solution  of 
c  c  c 

Eqs.  (3.4)  approaches  aperiodic  solution. 
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Proof.  The  proof  depends  on  the  following  properties  of  the  function 


V(x,  y,  z)  «=  (X  -  +  (y  -  y^)^  +  (z  -  z^)  -  z^  log  (3. 13) 

c 

defined  in  the  half-space  z  >  0  ,  -  <»<x,  y<+«. 

•  V  >  0  V  =  0  if  and  only  if  x  =  x^  ,  y  =  y^  ,  z  =  z  ^  . 

2  9 

•  V  —  00  if  either  z—0  or  x  +y— oo  in  z>0. 

•  If  0  <  a  <  00  ,  the  set  E(a)  ®  |  (x,y,z)  |  V(x,y,  z)  «  a  ^  is  a  closed  and  bounded 
subset  of  z  >  0  and  E(a)C  E(/3)  if  a  <  ^  . 

•  Along  a  solution  curve  of  the  system  (3.  7). 

■  I  ¥  "  ■  '‘c^^  ~  ■  V  ^  "  ^c)^ 

The  right-hand  member  of  this  relation  is  a  positive  definite  quadratic  form 
in  the  variables  (x  -  v  )  ,  (y  -  y^)  provided  that  u'<  v  <  1/9  ,  where 
V  -  0. 0836  is  the  i  real  root  of  the  discriminant  (which  is  a  rational 

function  of  v).  The  >  implication  of  Eq.(3.14)needed  in  the  proof  is  the 
inequality 

(y-j'/l 

where 

B  =  2(1  -  2x^)  - 

is  the  positive  smallest  eigenvalue  of  the  form. 

These  properties  are  either  obvious  or  can  be  proved  by  direct  calculation. 
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If  x(t)  ,  y(t),  z(t)  is  the  solution  of  Eqs.  (3.7)with  initial  values  i  Vq  »  0 

and  if  g;^  =  V(x^  ,  ,  z^)  ,  the  phase  point  (x(t),  y(t),  z(t))  belongs  to  E(a^)  for 

t  >0  since  V  =  v[x{t)  ,  y(t)  ,  z(t)]  is  a  nonincreasing  function  of  t  by  Eq.  (3. 15). 

Thus,  the  phase  point  is  bounded;  from  Eqs.  (3. 7),  it  follows  that  the  phase-point 
velocity  (x^  +  y2  +  is  also  uniformly  boimded  for  t  >0  . 

It  will  be  shown  first  that  x(t)  —  x  andy(t)— y  as  t -*  «>  ,  or,  equivalently,  that 

c  c  2  2 

no  point  of  the  limit  set  of  a  solution  lies  off  the  half-line  z>0(x-x)  +(y-y)  =0. 

c  c 

Assuming  the  contrary,  there  is  point  P  of  the  limit  set,  a  neighborhood,  N(P)  ,  of 

2  2 

P  ,  and  an  c  >  0  such  that  N(P)  does  not  intersect  the  cjTinder  (x  -  x^)  +  (y  -  y^)  s  e  . 
During  the  time  intervals  that  the  phase  point  lies  in  N(P),  Eq.  (3. 15)  implies 
dV/dt  <  -  2Bc  .  Since  V  is  nonnegative  and  nonincreasing,  this  inequality  implies 
that  the  time  which  a  phase  point  spends  in  N(P)  is  bounded.  The  boundedness  of  the 
phase  point  velocity  and  the  assumption  that  P  belongs  to  the  limit  set  imply,  how¬ 
ever,  that  the  time  spent  in  N(P)  is  unbounded.  The  contradiction  proves  that  the 
limit  set  lies  on  the  half-line  and  hence  that  x(t)  —  x^  and  y(t)  —  y^  . 

It  remains  to  prove  that  lim  z(t)  exists  and  equals  z^.  Along  a  trajectorj',  V 
decreases  and  thus  has  a  limit.  From  Eq.  (3. 13)  and  the  existence  of  limits  for 
V,  x,  and  y,  there  follows  the  existence  of  lim  (z  -  z^  log  z)  and,  hence,  that  of 
lim  z.  From  Eq.  (3.7a),  the  existence  of  lim  x  is  then  immediate.  But  if  lim  x 
and  lim  x  exist,  it  follows  at  once  that  lim  x  ==  0.  Hence,  the  right-hand  member 
of  Eq.  (3.7a)  converges  to  zero  as  t  —  «>,  which  implies  that  lim  z  =  and  con¬ 
cludes  the  proof. 
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